SEQUENCE LISTING 
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Rouviere, Pierre 
Cheng, Qiong 

<120> Increasing Carotenoid Production in Bacteria via Chromosomal 
Integration 

<130> CL2027 US NA 

<150> US 60/434618 
<151> 2002-12-19 

<160> 66 

<170> Patentin version 3.2 

<210> 1 

<211> 912 

<212> DNA 

<213> Pantoea stewartii 



<220> 

<221> mi sc_feature 

<222> CD . . C3) 

<223> ttg alternative start codon used to encode methionine 

<400> 1 



ttgacggtct 


gcgcaaaaaa 


acacgttcac 


cttactggca 


tttcggctga 


gcagttgctg 


60 


gctgatatcg 


atagccgcct 


tgatcagtta 


ctgccggttc 


agggtgagcg 


ggattgtgtg 


120 


ggtgccgcga 


tgcgtgaagg 


cacgctggca 


ccgggcaaac 


gtattcgtcc 


gatgctgctg 


180 


ttattaacag 


cgcgcgatct 


tggctgtgcg 


atcagtcacg 


ggggattact 


ggatttagcc 


240 


tgcgcggttg 


aaatggtgca 


tgctgcctcg 


ctgattctgg 


atgatatgcc 


ctgcatggac 


300 


gatgcgcaga 


tgcgtcgggg 


gcgtcccacc 


attcacacgc 


agtacggtga 


acatgtggcg 


360 


attctggcgg 


cggtcgcttt 


actcagcaaa 


gcgtttgggg 


tgattgccga 


ggctgaaggt 


420 


ctgacgccga 


tagccaaaac 


tcgcgcggtg 


tcggagctgt 


ccactgcgat 


tggcatgcag 


480 


ggtctggttc 


agggccagtt 


taaggacctc 


tcggaaggcg 


ataaaccccg 


cagcgccgat 


540 


gccatactgc 


taaccaatca 


gtttaaaacc 


agcacgctgt 


tttgcgcgtc 


aacgcaaatg 


600 


gcgtccattg 


cggccaacgc 


gtcctgcgaa 


gcgcgtgaga 


acctgcatcg 


tttctcgctc 


660 


gatctcggcc 


aggcctttca 


gttgcttgac 


gatcttaccg 


atggcatgac 


cgataccggc 


720 


aaagacatca 


atcaggatgc 


aggtaaatca 


acgctggtca 


atttattagg 


ctcaggcgcg 


780 


gtcgaagaac 


gcctgcgaca 


gcatttgcgc 


ctggccagtg 


aacacctttc 


cgcggcatgc 


840 


caaaacggcc 


attccaccac 


ccaacttttt 


attcaggcct 


ggtttgacaa 


aaaactcgct 


900 


gccgtcagtt 


aa 










912 



<210> 2 

<211> 303 

<212> PRT 

<213> Pantoea stewartii 
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<400> 2 

Met Thr val cys Ala Lys Lys His val His Leu Thr Gly lie ser Ala 
1 5 10 15 

Glu Gin Leu Leu Ala Asp lie Asp Ser Arg Leu Asp Gin Leu Leu Pro 
20 25 30 

val Gin Gly Glu Arg Asp Cys Val Gly Ala Ala Met Arg Glu Gly Thr 
35 40 45 

Leu Ala Pro Gly Lys Arg lie Arg Pro Met Leu Leu Leu Leu Thr Ala 
50 55 60 

Arg Asp Leu Gly cys Ala lie ser His Gly Gly Leu Leu Asp Leu Ala 
65 70 75 80 

Cys Ala Val Glu Met val His Ala Ala Ser Leu lie Leu Asp Asp Met 
85 90 95 

Pro Cys Met Asp Asp Ala Gin Met Arg Arg Gly Arg Pro Thr lie His 
100 105 110 

Thr Gin Tyr Gly Glu His Val Ala He Leu Ala Ala Val Ala Leu Leu 
115 120 125 

Ser Lys Ala Phe Gly val He Ala Glu Ala Glu Gly Leu Thr Pro lie 
130 135 140 

Ala Lys Thr Arg Ala Val Ser Glu Leu Ser Thr Ala lie Gly Met Gin 
145 150 155 160 

Gly Leu val Gin Gly Gin Phe Lys Asp Leu Ser Glu Gly Asp Lys Pro 
165 170 175 

Arg Ser Ala Asp Ala lie Leu Leu Thr Asn Gin Phe Lys Thr Ser Thr 
180 185 190 

Leu Phe Cys Ala Ser Thr Gin Met Ala Ser lie Ala Ala Asn Ala Ser 
195 200 205 

Cys Glu Ala Arg Glu Asn Leu His Arg Phe. ser Leu Asp Leu Gly Gin 
210 215 220 

Ala Phe Gin Leu Leu Asp Asp Leu Thr Asp Gly Met Thr Asp Thr Gly 
225 230 235 240 

Lys Asp lie Asn Gin Asp Ala Gly Lys Ser Thr Leu Val Asn Leu Leu 
245 250 255 

Gly ser Gly Ala val Glu Glu Arg Leu Arg Gin His Leu Arg Leu Ala 
260 265 270 
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Ser Glu His Leu ser Ala Ala Cys Gin Asn Gly His Ser Thr Thr Gin 
275 280 285 



Leu Phe lie Gin Ala Trp Phe Asp Lys Lys Leu Ala Ala val Ser 
290 295 300 



<210> 3 

<211> 1296 

<212> DNA 

<213> Pantoea stewartii 



<220> 

<221> CDS 

<222> (1) . . (1296) 

<400> 3 

atg age cat ttt gcg gtq ate gca ccg ccc ttt ttc age cat gtt cgc 48 

Met ser His Phe Ala val lie Ala Pro Pro Phe Phe ser His val Arg 
15 10 15 

get etg caa aac ett get eag gaa tta gtq gee cgc gqt cat cgt gtt 96 
Ala Leu Gin Asn Leu Ala Gin Glu Leu val Ala Arg Gly His Arg val 
20 25 30 

aeg ttt ttt eag caa cat gae tgc aaa gcg etg gta aeg ggc age gat 144 
Thr Phe Phe Gin Gin His Asp cys Lys Ala Leu val Thr Gly Ser Asp 
35 40 45 

ate gqa tte eag aee gte gqa etg caa aeg cat cet ccc gqt tec tta 192 
lie Gly Phe Gin Thr Val Gly Leu Gin Thr His Pro Pro Gly Ser Leu 
50 55 60 

teg cac etg etg cac etg gee gcg cae cea etc gga ccc teg atg tta 240 
Ser His Leu Leu His Leu Ala Ala His Pro Leu Gly Pro ser Met Leu 
65 70 75 80 

ega etg ate aat gaa atg gca cgt aee age gat atg ett tgc egg gaa 288 
Arg Leu He Asn Glu Met Ala Arg Thr ser Asp Met Leu Cys Arg Glu 
85 90 95 

etg ccc gee get ttt eat gcg ttg eag ata gag ggc gtq ate gtt gat 336 
Leu Pro Ala Ala Phe His Ala Leu Gin lie Glu Gly Val lie val Asp 
100 105 110 

caa atg gag eeg gca gqt gca gta gte gca gaa gcg tea gqt etg ccg 384 
Gin Met Glu Pro Ala Gly Ala val val Ala Glu Ala Ser Gly Leu Pro 
115 120 125 

ttt gtt teg gtq gee tgc gcg etg eeg etc aae ege gaa eeg gqt ttg 432 
Phe val Ser vaT Ala Cys Ala Leu Pro Leu Asn Arg Glu Pro Gly Leu 
130 135 140 

cet etg gcg gtq atg cet ttc gag tae ggc ace age. gat gcg get egg 480 
Pro Leu Ala val Met Pro Phe Glu Tyr Gly Thr Ser Asp Ala Ala Arg 
145 150 155 160 

gaa cgc tat ace ace age gaa aaa att tat gae tgg etg atg ega cgt 528 
Glu Arg Tyr Thr Thr ser Glu Lys lie Tyr Asp Trp Leu Met Arg Arg 
165 170 175 

cac gat cgt gtg ate gcg cat cat gca tgc aga atg ggt tta gee eeg 576 
His Asp Arg val lie Ala His His Ala Cys Arg Met Gly Leu Ala Pro 
180 185 190 

cgt gaa aaa etg eat cat tgt ttt tct cea etg gca caa ate age eag 624 
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Arg Glu Lys Leu His His Cys Phe Ser Pro Leu Ala Gin lie Sen Gin 

195 200 205 

ttg ate ccc gaa ctg gat ttt ccc cgc aaa gcg ctg cca gac tgc ttt 672 

Leu lie Pro Glu Leu Asp Phe Pro Arg Lys Ala Leu Pro Asp cys Phe 

210 215 220 

cat gcg gtt gqa ccg tta egg caa ccc cag gqg acg ccg ggg tea tea 720 

His Ala Val Gly Pro Leu Arg Gin Pro Gin Gly Thr Pro Gly ser ser 

225 230 235 240 

act tct tat ttt ccg tec ccg gac aaa ccc cgt att ttt gee teg ctg 768 

Thr Ser Tyr Phe Pro ser Pro Asp Lys Pro Arg lie Phe Ala ser Leu 

245 250 255 

ggc ace ctg cag gqa eat cgt tat ggc ctg ttc agg ace ate gee aaa 816 

Gly Thr Leu Gin Gly His Arg Tyr Gly Leu Phe Arg Thr lie Ala Lys 
260 265 270 

gee tgc gaa gag gtq gat gcg cag tta ctg ttg gea cac tgt ggc ggc 

Ala Cys Glu Glu Val Asp Ala Gin Leu Leu Leu Ala His cys Gly Gly 

275 280 285 



864 



etc tea gee acg cag gea ggt gaa ctg gee egg ggc ggg gac att cag 912 

Leu ser Ala Thr Gin Ala Gly Glu Leu Ala Arg Gly Gly Asp lie Gin 
290 295 300 

gtt gtq gat ttt gee gat caa tec gea gea ett tea cag gea cag ttg 960 

val val Asp Phe Ala Asp Gin Ser Ala Ala Leu ser Gin. Ala Gin Leu 

305 310 315 320 

aca ate aca eat ggt ggg atg aat acg gta ctg gac get att get tec 

Thr lie Thr His Gly Gly Met Asn Thr val Leu Asp Ala lie Ala Ser 
325 330 335 



1008 



cgc aca ccg eta ctg gcg ctg ccg ctg gea ttt gat caa eet ggc gtg 1056 
Arg Thr Pro Leu Leu Ala Leu Pro Leu Ala Phe Asp Gin Pro Gly Val 
340 345 350 

gea tea cga att gtt tat cat ggc ate ggc aag cgt gcg tct egg ttt 1104 
Ala ser Arg lie val Tyr His Gly lie Gly Lys Arg Ala ser Arg Phe 
355 360 y y ^ 

act ace age cat gcg ctg gcg egg cag att cga teg ctg ctg act aac 1152 
Thr Thr Ser His Ala Leu Ala Arg Gin lie Arg Ser Leu Leu Thr Asn 
370 375 380 

ace gat tac ccg cag cgt atg aca aaa att cag gee gea ttg cgt ctg 1200 
Thr Asp Tyr Pro Gin Arg Met Thr Lys lie Gin Ala Ala Leu Arg Leu 
385 390 395 400 

gea ggc ggc aca eca gee gee gee gat att gtt gaa cag gcg atg egg 1248 
Ala Gly Gly Thr Pro Ala Ala Ala Asp lie Val Glu Gin Ala Met Arg 
405 410 415 

ace tgt cag eca gta etc agt ggg cag gat tat gea ace gea eta tga 1296 
Thr cys Gin Pro val Leu Ser Gly Gin Asp Tyr Ala Thr Ala Leu 
420 425 430 

<210> 4 
<211> 431 
<212> PRT 

<213> Pantoea stewartii 
<400> 4 

Met ser His Phe Ala val lie Ala Pro Pro Phe Phe Ser His val Arg 
15 10 15 
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Ala Leu Gin Asn Leu Ala Gin Glu Leu val Ala Arg Gly His Arg val 
20 25 30 

Thr Phe Phe Gin Gin His Asp Cys Lys Ala Leu val Thr Gly sen Asp 
35 40 45 

lie Gly Phe Gin Thr Val Gly Leu Gin Thr His Pro Pro Gly Ser Leu 
50 55 60 

Ser His Leu Leu His Leu Ala Ala His Pro Leu Gly Pro Ser Met Leu 
65 70 75 80 

Arg Leu lie Asn Glu Met Ala Arg Thr Ser Asp Met Leu Cys Arg Glu 
85 90 95 

Leu Pro Ala Ala Phe His Ala Leu Gin lie Glu Gly val lie val Asp 
100 105 110 

Gin Met Glu Pro Ala Gly Ala val Val Ala Glu Ala ser Gly Leu Pro 
115 120 125 

Phe Val ser val Ala cys Ala Leu Pro Leu Asn Arg Glu Pro Gly Leu 
130 135 140 

Pro Leu Ala Val Met Pro Phe Glu Tyr Gly Thr Ser Asp Ala Ala Arg 
145 150 155 160 

Glu Arg Tyr Thr Thr ser Glu Lys lie Tyr Asp Trp Leu Met Arg Arg 
165 170 175 

His Asp Arg Val lie Ala His His Ala Cys Arg Met Gly Leu Ala Pro 
180 185 190 

Arg Glu Lys Leu His His Cys Phe Ser Pro Leu Ala Gin lie Ser Gin 
195 200 205 

Leu lie Pro Glu Leu Asp Phe Pro Arg Lys Ala Leu Pro Asp Cys Phe 
210 215 220 

His Ala Val Gly Pro Leu Arg Gin Pro Gin Gly Thr Pro Gly Ser ser 
225 230 235 240 

Thr Ser Tyr Phe Pro ser Pro Asp Lys Pro Arg He Phe Ala Ser Leu 
245 250 255 

Gly Thr Leu Gin Gly His Arg Tyr Gly Leu Phe Arg Thr lie Ala Lys 
260 265 270 

Ala cys Glu Glu val Asp Ala Gin Leu Leu Leu Ala His Cys Gly Gly 
275 280 285 
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Leu ser Ala Thr Gin Ala Gly Glu Leu Ala Arg Gly Gly Asp lie Gin 
290 295 300 



val Val Asp Phe Ala Asp Gin ser Ala Ala Leu ser Gin Ala Gin Leu 
305 310 315 320 

Thr lie Thr His Gly Gly Met Asn Thr Val Leu Asp Ala lie Ala Ser 
325 330 335 

Arg Thr Pro Leu Leu Ala Leu Pro Leu Ala Phe Asp Gin Pro Gly val 
340 345 350 

Ala Ser Arg lie Val Tyr His Gly lie Gly Lys Arg Ala ser Arg Phe 
355 360 365 

Thr Thr Ser His Ala Leu Ala Arg Gin lie Arg ser Leu Leu Thr Asn 
370 375 380 

Thr Asp Tyr Pro Gin Arg Met Thr Lys lie Gin Ala Ala Leu Arg Leu 
385 390 395 400 

Ala Gly Gly Thr Pro Ala Ala Ala Asp lie val Glu Gin Ala Met Arg 
405 410 415 

Thr cys Gin Pro val Leu Ser Gly Gin Asp Tyr Ala Thr Ala Leu 
420 425 430 

<210> 5 
<211> 1149 
<212> DNA 

<213> Pantoea stewartii 

<220> 

<221> CDS 

<222> (1) . . (1149) 

<400> 5 

atg caa ccg cac tat gat etc att ctg gtc gqt gcc gqt ctg get aat 
Met Gin Pro His Tyr Asp Leu lie Leu val Gly Ala Gly Leu Ala Asn 
15 10 15 



ctt gtg gtc cat cac tgg ccc gac tac cag gtt cgt ttc ccc caa cgc 
Leu val val His His Trp Pro Asp Tyr Gin val Arg Phe Pro Gin Arg 
65 70 75 80 



48 



ggc ctt ate gcg etc egg ctt cag caa cag cat ccg gat atg egg ate 96 
Gly Leu lie Ala Leu Arg Leu Gin Gin Gin His Pro Asp Met Arg lie 
20 25 30 

ttg ctt att gag gcg gqt ect gag gcg gqa gqg aac cat ace tgg tee 144 
Leu Leu lie Glu Ala Gly Pro Glu Ala Gly Gly Asn His Thr Trp Ser 
35 40 45 

ttt cac gaa gag gat tta aeg ctg aat cag cat cgc tgg ata gcg ccg 192 
Phe His Glu Glu Asp Leu Thr Leu Asn Gin His Arg Trp lie Ala Pro 
50 55 60 



240 



cgt cgc cat gtg aac agt ggc tac tac tge gtg ace tec egg cat ttc 288 
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Arq Arg His val Asn ser Gly Tyr Tyr cys val Thr ser Arg His Phe 

85 90 95 

gcc gqg ata etc egg caa cag ttt gqa caa cat tta tgg ctg cat acc 336 

Ala Gly lie Leu Arg Gin Gin Phe Gly Gin His Leu Trp Leu His Thr 

100 105 110 

gcg gtt tea gcc gtt cat get gaa teg gte cag tta geg gat gqc egg 384 

Ala val ser Ala val His Ala Glu ser Val Gin Leu Ala Asp Gly Arg 

115 120 125 

att att eat gee agt aca gtq ate gae gqa egg gqt tac acg cct gat 432 

lie lie His Ala Ser Thr Val lie Asp Gly Arg Gly Tyr Thr Pro Asp 
130 135 140 

tet gca eta cge gta gqa ttc cag gea ttt ate gqt cag gag tgg caa 480 

ser Ala Leu Arg val Gly Phe Gin Ala Phe lie Gly Gin Glu Trp Gin 

145 150 155 160 

ctg age gcg ccg cat gqt tta teg tea ccg att ate atg gat gcg acg 528 

Leu ser Ala Pro His Gly Leu Ser Ser Pro lie lie Met Asp Ala Thr 

165 170 175 

gte gat cag caa aat gqc tac cge ttt gtt tat acc ctg ccg ett tec 576 

Val Asp Gin Gin Asn Gly Tyr Arg Phe val Tyr Thr Leu Pro Leu Ser 

180 185 190 

gca acc gca ctg ctg ate gaa gae aca cae tac att gae aag get aat 624 

Ala Thr Ala Leu Leu lie Glu Asp Thr His Tyr lie Asp Lys Ala Asn 

195 200 205 

ett cag gcc gaa egg gcg egt cag aac att cge gat tat get gcg cga 672 

Leu Gin Ala Glu Arg Ala Arg Gin Asn lie Arg Asp Tyr Ala Ala Arg 
210 215 220 

cag gqt tgg ccg tta cag acg ttg ctg egg gaa gaa cag gqt gca ttg 720 

Gin Gly Trp Pro Leu Gin Thr Leu Leu Arg Glu Glu Gin Gly Ala Leu 

225 230 235 240 

ccc att acg tta acg gqc gat aat cgt cag ttt tgg caa cag caa ccg 768 

Pro lie Thr Leu Thr Gly Asp Asn Arg Gin Phe Trp Gin Gin Gin Pro 

245 250 255 

caa gcc tgt age gqa tta cge gee gqg ctg ttt cat ccg aca ace gqc 816 

Gin Ala Cys Ser Gly Leu Arg Ala Gly Leu Phe His Pro Thr Thr Gly 

260 265 270 

tac tec eta ccg etc gcg gtq gcg ctg gcc gat egt etc age gcg ctg 864 

Tyr ser Leu Pro Leu Ala Val Ala Leu Ala Asp Arg Leu ser Ala Leu 

275 280 285 

gat gtq ttt ace tet tec tet gtt cae cag acg att get cae ttt gcc 912 

Asp Val Phe Thr ser Ser Ser val His Gin Thr lie Ala His Phe Ala 
290 295 300 

cag caa cgt tgg cag caa cag gqg ttt ttc cge atg ctg aat cge atg 960 

Gin Gin Arg Trp Gin Gin Gin Gly Phe Phe Arg Met Leu Asn Arg Met 

305 310 315 32t) 



ttg ttt tta gee gqa ccg gee gag tea cge tgg cgt gtq atg cag cgt 
Leu Phe Leu Ala Gly Pro Ala Glu Ser Arg Trp Arg val Met Gin Arg 
325 330 335 



1008 



ttc tat ggc tta eec gag gat ttg att gee cge ttt tat gcg gqa aaa 1056 
Phe Tyr Gly Leu Pro Glu Asp Leu lie Ala Arg Phe Tyr Ala Gly Lys 
340 345 350 

etc acc gtq acc gat egg eta cge att ctg age gqc aag ccg ccc gtt 1104 
Leu Thr val Thr Asp Arg Leu Arg lie Leu ser Gly Lys Pro Pro Val 
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355 360 365 

ccc gtt ttc gcg gca ttg cag gca att atg acg act cat cgt tga 1149 
Pro val Phe Ala Ala Leu Gin Ala lie Met Thr Thr His Arg 
370 375 380 

<210> 6 
<211> 382 
<212> PRT 

<213> Pantoea stewartii 
<400> 6 

Met Gin Pro His Tyr Asp Leu lie Leu val Gly Ala Gly Leu Ala Asn 
15 10 15 

Gly Leu lie Ala Leu Arg Leu Gin Gin Gin His Pro Asp Met Arg lie 
20 25 30 

Leu Leu lie Glu Ala Gly Pro Glu Ala Gly Gly Asn His Thr Trp Ser 
35 40 45 

Phe His Glu Glu Asp Leu Thr Leu Asn Gin His Arg Trp lie Ala Pro 
50 55 60 

Leu val val His His Trp Pro Asp Tyr Gin val Arg Phe Pro Gin Arg 
65 70 75 80 

Arg Arg His val Asn Ser Gly Tyr Tyr Cys Val Thr Ser Arg His Phe 
85 90 95 

Ala Gly lie Leu Arg Gin Gin Phe Gly Gin His Leu Trp Leu His Thr 
100 105 110 

Ala val ser Ala val His Ala Glu ser val Gin Leu Ala Asp Gly Arg 
115 120 125 

lie lie His Ala ser Thr Val lie Asp Gly Arg Gly Tyr Thr Pro Asp 
130 135 140 

Ser Ala Leu Arg val Gly Phe Gin Ala Phe lie Gly Gin Glu Trp Gin 
145 150 155 160 

Leu ser Ala Pro His Gly Leu Ser ser Pro lie lie Met Asp Ala Thr 
165 170 175 

val Asp Gin Gin Asn Gly Tyr Arg Phe Val Tyr Thr Leu Pro Leu ser 
180 185 190 

Ala Thr Ala Leu Leu lie Glu Asp Thr His Tyr He Asp Lys Ala Asn 
195 200 205 

Leu Gin Ala Glu Arg Ala Arg Gin Asn lie Arg Asp. Tyr Ala Ala Arg 
210 215 220 
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Gin Gly Trp Pro Leu Gin Thr Leu Leu Arg Glu Glu Gin Gly Ala Leu 
225 230 235 240 



Pro lie Thr Leu Thr Gly Asp Asn Arg Gin Phe Trp Gin Gin Gin Pro 
245 250 255 

Gin Ala Cys ser Gly Leu Arg Ala Gly Leu Phe His Pro Thr Thr Gly 
260 265 270 

Tyr Ser Leu Pro Leu Ala val Ala Leu Ala Asp Arg Leu Ser Ala Leu 
275 280 285 

Asp Val Phe Thr ser ser Ser val His Gin Thr lie Ala His Phe Ala 
290 295 300 

Gin Gin Arg Trp Gin Gin Gin Gly Phe Phe Arg Met Leu Asn Arg Met 
305 310 315 320 

Leu Phe Leu Ala Gly Pro Ala Glu Ser Arg Trp Arg val Met Gin Arg 
325 330 335 

Phe Tyr Gly Leu Pro Glu Asp Leu lie Ala Arg Phe Tyr Ala Gly Lys 
340 345 350 

Leu Thr Val Thr Asp Arg Leu Arg lie Leu Ser Gly Lys Pro Pro Val 
355 360 365 

Pro val Phe Ala Ala Leu Gin Ala lie Met Thr Thr His Arg 
370 375 380 

<210> 7 

<211> 1479 

<212> DNA 

<213> Pantoea stewartii 

<220> 

<221> CDS 

<222> (1) . . (1479) 

<400> 7 

atg aaa cca act acg gta att ggt gcg ggc ttt gqt ggc ctg gca ctg 48 

Met Lys Pro Thr Thr val lie Gly Ala Gly Phe Gly Gly Leu Ala Leu 

1 5 10 15 

gca att cgt tta cag gcc gca g^t att cct gtt ttg ctg ctt gag cag 96 



Ala He Arg Leu Gin Ala Ala Gly lie Pro val Leu Leu Leu Glu Gin 

20 25 30 

cgc gac aag ccg ggt ggc egg get tat gtt tat cag gag cag ggc ttt 

Arg Asp Lys Pro Gly Gly Arg Ala Tyr val Tyr Gin Glu Gin Gly Phe 

35 40 45 



144 



act ttt gat gca ggc cct acc gtt ate acc gat ccc age gcg att gaa 192 
Thr Phe Asp Ala Gly Pro Thr val lie Thr Asp Pro Ser Ala lie Glu 
50 55 60 

gaa ctg ttt get ctg gee ggt aaa cag ctt aag gat tac gtc gag ctg 240 
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Glu Leu Phe Ala Leu Ala Gly Lys Gin Leu Lys Asp Tyr val Glu Leu 

65 70 75 80 

ttg ccg gtc acg ccg ttt tat cgc ctg tgc tgg gag tec gqc aag gtc 288 

Leu Pro Val Thr Pro Phe Tyr Arg Leu Cys Trp Glu Ser Gly Lys val 

85 90 95 

ttc aat tac gat aac gac cag gcc cag tta gaa gcg cag ata cag cag 336 

Phe Asn Tyr Asp Asn Asp Gin Ala Gin Leu Glu Ala Gin He Gin Gin 

100 105 110 

ttt aat ccg cgc gat gtt gcg ggt tat cga gcg ttc ctt gac tat teg 384 

Phe Asn Pro Arg Asp val Ala Gly Tyr Arg Ala Phe Leu Asp Tyr Ser 
115 120 125 

cgt gcc gta ttc aat gag gqc tat ctg aag etc gqc act gtq cct ttt 

Arg Ala Val Phe Asn Glu Gly Tyr Leu Lys Leu Gly Thr val Pro Phe 
130 135 140 



Asn Ala Arg Val Ser His Met Glu Thr val Gly Asp Lys lie Gin Ala 

245 250 255 

gtq cag ttg gaa gac gqc aga egg ttt gaa ace tgc gcg gtq gcg teg 

val Gin Leu Glu Asp Gly Arg Arg Phe Glu Thr cys Ala val Ala Ser 



432 



tta teg ttc aaa gac atg ctt egg gcc gcg ecc cag ttg gea aag ctg 480 

Leu Ser Phe Lys Asp Met Leu Arg Ala Ala Pro Gin Leu Ala Lys Leu 

145 150 155 160 

cag gea tgg cgc age gtt tac agt aaa gtt gee gqc tac att gag gat 528 

Gin Ala Trp Arg Ser val Tyr ser Lys val Ala Gly Tyr lie Glu Asp 
165 170 175 

gag cat ctt egg cag gcg ttt tct ttt cae teg etc tta gtq gqg ggg 576 

Glu His Leu Arg Gin Ala Phe ser Phe His Ser Leu Leu val Gly Gly 
180 185 190 

aat ccg ttt gea ace teg tec att tat acg ctg att cae gcg tta gaa 624 

Asn Pro Phe Ala Thr Ser Ser lie Tyr Thr Leu lie His Ala Leu Glu 
195 200 205 

egg gaa tgg gqc gtc tgg ttt cea cgc ggt gga ace ggt gcg ctg gtc 672 

Arg Glu Trp Gly val Trp Phe Pro Arg Gly Gly Thr Gly Ala Leu val 
210 215 220 

aat gge atg ate aag ctg ttt cag gat ctg ggc ggc gaa gtc gtg ctt 720 

Asn Gly Met lie Lys Leu Phe Gin Asp Leu Gly Gly Glu Val val Leu 

225 230 235 240 

aac gcc egg gtc agt cat atg gaa ace gtt g^g gac aag att cag gcc 768 



816 



aac get gat gtt gta eat ace tat cgc gat ctg ctg tct cag cat ecc 864 

Asn Ala Asp val val His Thr Tyr Arg Asp Leu Leu ser Gin His Pro 
275 280 285 

gea gcc get aag cag gcg aaa aaa ctg caa tec aag cgt atg agt aac 912 

Ala Ala Ala Lys Gin Ala Lys Lys Leu Gin ser Lys Arg Met ser Asn 
290 295 300 

tea ctg ttt gta etc tat ttt ggt etc aac cat cat cae gat caa etc 960 

ser Leu Phe val Leu Tyr Phe Gly Leu Asn His His His Asp Gin Leu 

305 310 315 320 

gee eat cat ace gtc tgt ttt ggg cea cgc tac cgt gaa ctg att cae 1008 

Ala His His Thr val Cys Phe Gly Pro Arg Tyr Arg Glu Leu lie His 

325 330 335 

gaa att ttt aac cat gat ggt ctg get gag gat ttt teg ctt tat tta 1056 

Glu lie Phe Asn His Asp Gly Leu Ala Glu Asp Phe Ser Leu Tyr Leu 
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340 345 350 

cac gca cct tgt gtc acg gat ccg tea ctg gca ccg gaa gqg tgc gqc 1104 
His Ala Pro Cys val Thr Asp Pro Ser Leu Ala Pro Glu Gly cys Gly 

355 360 365 

age tat tat gtq ctg gcg cct gtt cca cac tta gqc acg gcg aac etc 1152 
Ser Tyr Tyr Val Leu Ala Pro val Pro His Leu Gly Thr Ala Asn Leu 
370 375 380 

gac tgg gcg gta gaa gqa ccc cga ctg cgc gat cgt att ttt gac tac 1200 
Asp Trp Ala Val Glu Gly Pro Arg Leu Arg Asp Arg lie Phe Asp Tyr 
385 390 395 400 

ctt gag caa cat tac atg cct gqc ttg cga age cag ttg gtg acg cac 1248 
Leu Glu Gin His Tyr Met Pro Gly Leu Arg Ser Gin Leu Val Thr His 
405 410 415 

cgt atg ttt acg ccg ttc gat ttc cgc gac gag etc aat gee tgg caa 1296 
Arg Met Phe Thr Pro Phe Asp Phe Arg Asp Glu Leu Asn Ala Trp Gin 
420 425 430 

ggt teg gee ttc teg gtt gaa cct att ctg ace cag age gee tgg ttc 1344 
Gly Ser Ala Phe Ser Val Glu Pro lie Leu Thr Gin Ser Ala Trp Phe 
435 440 445 

cga cca eat aac cgc gat aag eac att gat aat ctt tat ctg gtt gqc 1392 
Arg Pro His Asn Arg Asp Lys His He Asp Asn Leu Tyr Leu val Gly 
450 455 460 



gca gqc ace eat eet gqc gcg gqc att eee gqc gta ate gqc teg gcg 
Ala Gly Thr His Pro Gly Ala Gly lie Pro Gly val lie Gly Ser Ala 
465 470 475 480 



1440 



aag gcg acg gca gqc tta atg ctg gag gac ctg att tga 1479 
Lys Ala Thr Ala Gly Leu Met Leu Glu Asp Leu lie 
485 490 

<210> 8 
<211> 492 
<212> PRT 

<213> Pantoea stewartii 
<400> 8 

Met Lys Pro Thr Thr val lie Gly Ala Gly Phe Gly Gly Leu Ala Leu 
15 10 15 

Ala lie Arg Leu Gin Ala Ala Gly lie Pro val Leu Leu Leu Glu Gin 
20 25 30 

Arg Asp Lys Pro Gly Gly Arg Ala Tyr val Tyr Gin Glu Gin Gly Phe 
35 40 45 

Thr Phe Asp Ala Gly Pro Thr val lie Thr Asp Pro Ser Ala lie Glu 
50 55 60 

Glu Leu Phe Ala Leu Ala Gly Lys Gin Leu Lys Asp Tyr val Glu Leu 
65 70 75 80 

Leu Pro val Thr Pro Phe Tyr Arg Leu Cys Trp Glu Ser Gly Lys val 
85 90 95 
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Phe Asn Tyr Asp Asn Asp Gin Ala Gin Leu Glu Ala Gin lie Gin Gin 
100 105 110 

Phe Asn Pro Arg Asp val Ala Gly Tyr Arg Ala Phe Leu Asp Tyr ser 
115 120 125 

Arg Ala val Phe Asn Glu Gly Tyr Leu Lys Leu Gly Thr Val Pro Phe 
130 135 140 

Leu ser Phe Lys Asp Met Leu Arg Ala Ala Pro Gin Leu Ala Lys Leu 
145 150 155 160 

Gin Ala Trp Arg Ser Val Tyr Ser Lys val Ala Gly Tyr lie Glu Asp 
165 170 175 

Glu His Leu Arg Gin Ala Phe Ser Phe His Ser Leu Leu val Gly Gly 
180 185 190 

Asn Pro Phe Ala Thr ser ser lie Tyr Thr Leu lie His Ala Leu Glu 
195 200 205 

Arg Glu Trp Gly Val Trp Phe Pro Arg Gly Gly Thr Gly Ala Leu val 
210 215 220 

Asn Gly Met lie Lys Leu Phe Gin Asp Leu Gly Gly Glu val val Leu 
225 230 235 240 

Asn Ala Arg val Ser His Met Glu Thr val Gly Asp Lys lie Gin Ala 
245 250 255 

Val Gin Leu Glu Asp Gly Arg Arg Phe Glu Thr Cys Ala val Ala ser 
260 265 270 

Asn Ala Asp val val His Thr Tyr Arg Asp Leu Leu ser Gin His Pro 
275 280 285 

Ala Ala Ala Lys Gin Ala Lys Lys Leu Gin ser Lys Arg Met Ser Asn 
290 295 300 

Ser Leu Phe Val Leu Tyr Phe Gly Leu Asn His His His Asp Gin Leu 
305 310 315 320 

Ala His His Thr val Cys Phe Gly Pro Arg Tyr Arg Glu Leu lie His 
325 330 335 

Glu lie Phe Asn His Asp Gly Leu Ala Glu Asp Phe Ser Leu Tyr Leu 
340 345 350 

His Ala Pro cys val Thr Asp Pro Ser Leu Ala Pro Glu Gly Cys Gly 
355 360 365 
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Ser Tyr Tyr val Leu Ala Pro val Pro His Leu Gly 
370 375 380 



Thr Ala Asn Leu 



Asp Trp Ala val Glu Gly Pro Arg Leu Arg Asp Arg lie Phe Asp Tyr 
385 390 395 400 

Leu Glu Gin His Tyr Met Pro Gly Leu Arg Ser Gin Leu val Thr His 
405 410 415 

Arg Met Phe Thr Pro Phe Asp Phe Arg Asp Glu Leu Asn Ala Trp Gin 
420 425 430 

Gly ser Ala Phe Ser Val Glu Pro lie Leu Thr Gin ser Ala Trp Phe 
435 440 445 

Arg Pro His Asn Arg Asp Lys His lie Asp Asn Leu Tyr Leu val Gly 
450 455 460 

Ala Gly Thr His Pro Gly Ala Gly lie Pro Gly val lie Gly Ser Ala 
465 470 475 480 

Lys Ala Thr Ala Gly Leu Met Leu Glu Asp Leu lie 
485 490 

<210> 9 
<211> 891 
<212> DNA 

<213> Pantoea stewartii 
<220> 

<221> CDS 

<222> (1) . . (891) 

<400> 9 

atg gcg gtt ggc teg aaa age ttt gcg act gca teg acg ctt ttc gac 48 

Met Ala Val Gly ser Lys ser Phe Ala Thr Ala ser Thr Leu Phe Asp 
15 10 15 

gcc aaa acc cgt cgc age gtg ctg atg ctt tac gca tgg tgc cgc cac 96 
Ala Lys Thr Arg Arg Ser val Leu Met Leu Tyr Ala Trp Cys Arg His 
20 25 30 

tgc gac gac gtc att gac gat caa aca ctg ggc ttt cat gcc gac cag 144 
Cys Asp Asp val lie Asp Asp Gin Thr Leu Gly Phe His Ala Asp Gin 
35 40 45 

ccc tct teg cag atg cct gag cag cgc ctg cag cag ctt gaa atg aaa 192 
Pro Ser Ser Gin Met Pro Glu Gin Arg Leu Gin Gin Leu Glu Met Lys 
50 55 60 

acg cgt cag gcc tac gcc ggt teg caa atg cac gag ccc get ttt gee 240 
Thr Arg Gin Ala Tyr Ala Gly Ser Gin Met His Glu Pro Ala Phe Ala 
65 70 75 80 

gcg ttt cag gag gtc gcg atg gcg cat gat ate get ccc gcc tac gcg 288 
Ala Phe Gin Glu val Ala Met Ala His Asp lie Ala Pro Ala Tyr Ala 
85 90 95 

ttc gac cat ctg gaa ggt ttt gee atg gat gtg cgc gaa acg cgc tac 336 
Phe Asp His Leu Glu Gly Phe Ala Met Asp val Arg Glu Thr Arg Tyr 
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100 105 110 

ctg aca ctg gac gat acg ctg cgt tat tgc tat cac gtc gcc ggt gtt 384 
Leu Thr Leu Asp Asp Thr Leu Arg Tyr cys Tyr His val Ala Gly val 
115 120 125 

gtg ggc ctg atg atg gcg caa att atg ggc gtt cgc gat aac gcc acg 432 
val Gly Leu Met Met Ala Gin lie Met Gly Val Arg Asp Asn Ala Thr 
130 135 140 

etc gat cgc gcc tgc gat etc ggg ctg get ttc cag ttg acc aac att 480 
Leu Asp Arg Ala Cys Asp Leu Gly Leu Ala Phe Gin Leu Thr Asn lie 
145 150 155 160 

gcg cgt gat att gtc gac gat get cag gtg ggc cgc tgt tat ctg cet 528 
Ala Arg Asp lie val Asp Asp Ala Gin val Gly Arg Cys Tyr Leu Pro 
165 170 175 

gaa age tgg ctg gaa gag gaa gga ctg acg aaa gcg aat tat get gcg 576 
Glu Ser Trp Leu Glu Glu Glu Gly Leu Thr Lys Ala Asn Tyr Ala Ala 
180 185 190 

cea gaa aac egg cag gee tta age cgt ate gee ggg ega etg gta egg 624 
Pro Glu Asn Arg Gin Ala Leu ser Arg lie Ala Gly Arg Leu val Arg 
195 200 205 

gaa gcg gaa eee tat tac gta tea tea atg gee ggt ctg gea caa tta 672 
Glu Ala Glu Pro Tyr Tyr val Ser Ser Met Ala Gly Leu Ala Gin Leu 
210 215 220 

eee tta cgc teg gee tgg gee ate gcg aca gcg aag cag gtg tac cgt 720 
Pro Leu Arg Ser Ala Trp Ala lie Ala Thr Ala Lys Gin val Tyr Arg 
225 230 235 240 

aaa att ggc gtg aaa gtt gaa cag gee ggt aag cag gee tgg gat cat 768 
Lys lie Gly val Lys Val Glu Gin Ala Gly Lys Gin Ala Trp Asp His 
245 250 255 

cgc cag tec acg tec acc gcc gaa aaa tta acg ett ttg etg acg gea 816 
Arg Gin ser Thr ser Thr Ala Glu Lys Leu Thr Leu Leu Leu Thr Ala 
260 265 270 

tec ggt cag gea gtt act tec egg atg aag acg tat cea ecc cgt cet 864 
Ser Gly Gin Ala val Thr ser Arg Met Lys Thr Tyr Pro Pro Arg Pro 
275 280 285 

get cat etc tgg cag cgc ecg ate tag 891 
Ala His Leu Trp Gin Arg Pro lie 
290 295 

<210> 10 
<211> 296 
<212> PRT 

<213> Pantoea stewartii 
<400> 10 

Met Ala Val Gly ser Lys Ser Phe Ala Thr Ala Ser Thr Leu Phe Asp 
15 10 15 

Ala Lys Thr Arg Arg ser val Leu Met Leu Tyr Ala Trp cys Arg His 
20 25 30 

Cys Asp Asp Val lie Asp Asp Gin Thr Leu Gly Phe His Ala Asp Gin 
35 40 45 
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Pro ser ser Gin Met Pro Glu Gin Arg Leu Gin Gin Leu Glu Met Lys 
50 55 60 

Thr Arg Gin Ala Tyr Ala Gly Ser Gin Met His Glu Pro Ala Phe Ala 
65 70 75 80 

Ala Phe Gin Glu val Ala Met Ala His Asp lie Ala Pro Ala Tyr Ala 
85 90 95 

Phe Asp His Leu Glu Gly Phe Ala Met Asp val Arg Glu Thr Arg Tyr 
100 105 110 

Leu Thr Leu Asp Asp Thr Leu Arg Tyr Cys Tyr His val Ala Gly val 
115 120 125 

val Gly Leu Met Met Ala Gin lie Met Gly Val Arg Asp Asn Ala Thr 
130 135 140 

Leu Asp Arg Ala Cys Asp Leu Gly Leu Ala Phe Gin Leu Thr Asn lie 
145 150 155 160 

Ala Arg Asp lie val Asp Asp Ala Gin val Gly Arg cys Tyr Leu Pro 
165 170 175 

Glu Ser Trp Leu Glu Glu Glu Gly Leu Thr Lys Ala Asn Tyr Ala Ala 
180 185 190 

Pro Glu Asn Arg Gin Ala Leu Ser Arg lie Ala Gly Arg Leu val Arg 
195 200 205 

Glu Ala Glu Pro Tyr Tyr Val Ser Ser Met Ala Gly Leu Ala Gin Leu 
210 215 220 

Pro Leu Arg Ser Ala Trp Ala lie Ala Thr Ala Lys Gin val Tyr Arg 
225 230 235 240 

Lys lie Gly val Lys val Glu Gin Ala Gly Lys Gin Ala Trp Asp His 
245 250 255 

Arg Gin ser Thr ser Thr Ala Glu Lys Leu Thr Leu Leu Leu Thr Ala 
260 265 270 

Ser Gly Gin Ala val Thr ser Arg Met Lys Thr Tyr Pro Pro Arg Pro 
275 280 285 

Ala His Leu Trp Gin Arg Pro lie 
290 295 

<210> 11 
<211> 528 
<212> DNA 

<213> Pantoea stewartii 
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<220> 

<221> CDS 

<222> (1)..(528) 



<400> 11 

atg ttg tgg att tgg aat gcc ctg ate gtq ttt gtc acc gtg gtc gqc 
Met Leu Trp lie Trp Asn Ala Leu lie val Phe val Thr val val Gly 



10 15 



48 



atg gaa gtq gtt get gca ctg gca cat aaa tac ate atg eac gqc tgg 96 
Met Glu val val Ala Ala Leu Ala His Lys Tyr lie Met His Gly Trp 
20 25 30 

ggt tgg gqc tgg cat ctt tea eat eat gaa eeg cgt aaa gqc gca ttt 144 
Gly Trp Gly Trp His Leu Ser His His Glu Pro Arg Lys Gly Ala Phe 
35 40 45 

gaa gtt aac gat etc tat gcc gtq gta tte gcc att gtq teg att gcc 192 
Glu val Asn Asp Leu Tyr Ala Val val Phe Ala lie Val Ser He Ala 
50 55 60 

ctg att tac tte gqc agt aca gga ate tgg eeg etc eag tgg att gqt 240 
Leu lie Tyr Phe Gly Ser Thr Gly lie Trp Pro Leu Gin Trp lie Gly 
65 70 75 80 

gca gqc atg acc get tat gqt tta ctg tat ttt atg gtc eac gae gga 288 
Ala Gly Met Thr Ala Tyr Gly Leu Leu Tyr Phe Met val His Asp Gly 
85 90 95 

ctg gta eac eag cge tgg eeg tte ege tac ata eeg egc aaa gge tac 336 
Leu Val His Gin Arg Trp Pro Phe Arg Tyr lie Pro Arg Lys Gly Tyr 
100 105 110 

ctg aaa egg tta tac atg gee eac cgt atg eat eat get gta agg gga 384 
Leu Lys Arg Leu Tyr Met Ala His Arg Met His His Ala val Arg Gly 
115 120 125 

aaa gag gge tge gtg tee ttt ggt ttt ctg tac gcg eca eeg tta tet 432 
Lys Glu Gly Cys Val Ser Phe Gly Phe Leu Tyr Ala Pro Pro Leu ser 
130 135 140 

aaa ctt eag gcg aeg ctg aga gaa agg eat gcg get aga teg gge get 480 
Lys Leu Gin Ala Thr Leu Arg Glu Arg His Ala Ala Arg ser Gly Ala 
145 150 155 160 

gcc aga gat gag eag gae ggg gtg gat aeg tet tea tec gqg aag taa 528 
Ala Arg Asp Glu Gin Asp Gly Val Asp Thr Ser Ser Ser Gly Lys 
165 170 175 

<210> 12 
<211> 175 
<212> PRT 

<213> Pantoea stewartii 
<400> 12 

Met Leu Trp lie Trp Asn Ala Leu lie Val Phe Val Thr Val val Gly 
15 10 15 

Met Glu val val Ala Ala Leu Ala His Lys Tyr lie Met His Gly Trp 
20 25 30 

Gly Trp Gly Trp His Leu ser His His Glu Pro Arg Lys Gly Ala Phe 
35 40 45 
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Glu val Asn Asp Leu Tyr Ala val val Phe Ala lie val ser lie Ala 
50 55 60 



Leu lie Tyr Phe Gly Ser Thr Gly lie Trp Pro Leu Gin Trp lie Gly 
65 70 75 80 

Ala Gly Met Thr Ala Tyr Gly Leu Leu Tyr Phe Met val His Asp Gly 
85 90 95 

Leu val His Gin Arg Trp Pro Phe Arg Tyr lie Pro Arg Lys Gly Tyr 
100 105 110 

Leu Lys Arg Leu Tyr Met Ala His Arg Met His His Ala val Arg Gly 
115 120 125 

Lys Glu Gly Cys Val Ser Phe Gly Phe Leu Tyr Ala Pro Pro Leu Ser 
130 135 140 

Lys Leu Gin Ala Thr Leu Arg Glu Arg His Ala Ala Arg ser Gly Ala 
145 150 155 160 

Ala Arg Asp Glu Gin Asp Gly Val Asp Thr ser ser ser Gly Lys 
165 170 175 

<210> 13 
<211> 1860 
<212> DNA 

<213> Methyl omonas 16a 
<220> 

<221> CDS 

<222> (1) . . (1860) 

<400> 13 

atg get ctt tec aaa gac ttc cct eta etc aat tec ate eae ace cca 48 

Met Ala Leu Ser Lys Asp Phe Pro Leu Leu Asn ser lie His Thr Pro 

15 10 15 

geg gac ata cge geg etg tee aag gae eag etc eag caa etg get gac 96 
Ala Asp lie Arg Ala Leu Ser Lys Asp Gin Leu Gin Gin Leu Ala Asp 
20 25 30 

Qcig gtg ege gqe tat etg ace eae aeg gte age att tee gqe gqe cat 144 
Glu val Arg Gly Tyr Leu Thr His Thr val ser lie ser Gly Gly His 
35 40 45 

ttt geg gee gqe etc gqe ace gtg gaa etg ace gtg gee ttg eat tat 
Phe Ala Ala Gly Leu Gly Thr val Glu Leu Thr val Ala Leu His Tyr 
50 55 60 



192 



gtg ttc aat ace cec gte gat eag ttg gte tgg gae gtg gqe eat eag 240 
val Phe Asn Thr Pro val Asp Gin Leu val Trp Asp val Gly His Gin 
65 70 75 80 

gcc tat ccg cac aag att etg ace gqt cgc aag gag cgc atg ccg acc 288 
Ala Tyr Pro His Lys lie Leu Thr Gly Arg Lys Glu Arg Met Pro Thr 
85 90 95 
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att cgc acc ctg gqc ggg gtq tea gcc ttt ccg gcg egg gac gag age 336 
lie Arg Thr Leu Gly Gly val ser Ala Phe Pro Ala Arg Asp Glu Sen 
100 105 110 

gaa tac gat gcc ttc gqc gtc gqc cat tec age ace teg ate age gcg 384 
Glu Tyr Asp Ala Phe Gly Val Gly His Ser Ser Thr Ser lie Ser Ala 
115 120 125 

gca ctg gqc atg gcc att gcg teg cag ctg cgc gqc gaa gac aag aag 432 
Ala Leu Gly Met Ala lie Ala Ser Gin Leu Arg Giy Glu Asp Lys Lys 
130 135 140 

atg gta gee ate ate gqc gac gqt tec ate ace gqc gqc atg gee tat 480 
Met val Ala lie He Gly Asp Gly ser lie Thr Gly Gly Met Ala Tyr 
145 150 155 160 

gag gcg atg aat eat gee gqc gat gtq aat gee aac ctg ctg gtq ate 528 
Glu Ala Met Asn His Ala Gly Asp val Asn Ala Asn Leu Leu val lie 
165 170 175 

ttg aac gac aac gat atg teg ate teg ccg ccg gtc ggg gcg atg aac 576 
Leu Asn Asp Asn Asp Met Ser lie ser Pro Pro val Gly Ala Met Asn 
180 185 190 

aat tat ctg acc aag gtq ttg teg age aag ttt tat teg teg gtq egg 624 
Asn Tyr Leu Thr Lys Val Leu Ser Ser Lys Phe Tyr ser Ser Val Arg 
195 200 205 

gaa gag age aag aaa get ctg gee aag atg ccg teg gtq tgg gaa ctg 672 
Glu Glu Ser Lys Lys Ala Leu Ala Lys Met Pro Ser val Trp Glu Leu 
210 215 220 

gcg cgc aag acc gag gaa eac gtq aag gqc atg ate gtq ccc gqt ace 720 
Ala Arg Lys Thr Glu Glu His val Lys Gly Met lie vaT Pro Gly Thr 
225 230 235 240 

ttg ttc gag gaa ttg gqc ttc aat tat ttc gqc ccg ate gac gqc eat 768 
Leu Phe Glu Glu Leu Gly Phe Asn Tyr Phe Gly Pro lie Asp Giy His 
245 250 255 

gat gtc gag atg ctg gtq teg ace ctg gaa aat ctg aag gat ttg ace 816 
Asp val Glu Met Leu val ser Thr Leu Glu Asn Leu Lys Asp Leu Thr 
260 265 270 

ggg ccg gta ttc ctg eat gtg gtg ace aag aag gge aaa gqc tat gcg 864 
Gly Pro val Phe Leu His val val Thr Lys Lys Gly Lys Gly Tyr Ala 
275 280 285 

cea gcc gag aaa gac ccg ttg gee tac cat gge gtg ccg get ttc gat 912 
Pro Ala Glu Lys Asp Pro Leu Ala Tyr His Gly val Pro Ala Phe Asp 
290 295 300 

ccg ace aag gat ttc ctg ccc aag gcg gcg ccg teg ccg eat ccg acc 960 
Pro Thr Lys Asp Phe Leu Pro Lys Ala Ala Pro Ser Pro His Pro Thr 
305 310 315 320 

tat acc gag gtg ttc gge cgc tgg ctg tgc gac atg gcg get caa gac 1008 
Tyr Thr Glu val Phe Gly Arg Trp Leu Cys Asp Met Ala Ala Gin Asp 
325 330 335 

gag cgc ttg ctg gge ate aeg ccg gcg atg cgc gaa gge tct ggt ttg 1056 
Glu Arg Leu Leu Gly lie Thr Pro Ala Met Arg Glu Gly Ser Gly Leu 
340 345 350 

gtq gaa ttc tea eag aaa ttt ccg aat cgc tat ttc gat gtc gee ate 1104 
Val Glu Phe Ser Gin Lys Phe Pro Asn Arg Tyr Phe Asp val Ala lie 
355 360 365 

gee gag cag cat gcg gtg ace ttg gee gee gge cag gee tgc eag gge 1152 
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Ala Glu Gin His Ala val Thr Leu Ala Ala Gly Gin Ala Cys Gin Gly 
370 375 380 

gcc aag ccg gtq gtq gcg att tat tec acc ttc ctg caa cgc ggt tac 1200 
Ala Lys Pro val val Ala lie Tyr Ser Thr Phe Leu Gin Arg Gly Tyr 
385 390 395 400 

gat cag ttg ate cac gac gtq gcc ttg cag aac tta gat atg etc ttt 1248 
Asp Gin Leu lie His Asp val Ala Leu Gin Asn Leu Asp Met Leu Phe 
405 410 415 

gca ctg gat cgt gcc gqc ttg gtc gqc ccg gat gqa ccg acc cat get 1296 
Ala Leu Asp Arg Ala Gly Leu Val Gly Pro Asp Gly Pro Thr His Ala 
420 425 430 

gqc gcc ttt gat tac age tac atg cgc tgt att ccg aac atg ctg ate 1344 
Gly Ala Phe Asp Tyr Ser Tyr Met Arg Cys lie Pro Asn Met Leu lie 
435 440 445 

atg get eca gcc gac gag aac gag tge agg cag atg ctg ace acc gqc 1392 
Met Ala Pro Ala Asp Glu Asn Glu Cys Arg Gin Met Leu Thr Thr Gly 
450 455 460 

ttc caa cac cat gqc ccg get teg gtq cgc tat ccg cgc gqc aaa ggg 1440 
Phe Gin His His Gly Pro Ala ser val Arg Tyr Pro Arg Gly Lys Gly 
465 470 475 480 

cec ggg gcg gca ate gat ccg acc ctg ace gcg ctg gag ate gqc aag 1488 
Pro Gly Ala Ala lie Asp Pro Thr Leu Thr Ala Leu Glu lie Gly Lys 
485 490 495 

gee gaa gtc aga cac cac gqc age cgc ate gee att ctg gee tgg gqc 1536 
Ala Glu val Arg His His Gly Ser Arg lie Ala lie Leu Ala Trp Gly 
500 505 510 

age atg gtc aeg cet gcc gtc gaa gcc gqc aag cag ctg gqc gcg aeg 1584 
ser Met val Thr Pro Ala val Glu Ala Gly Lys Gin Leu Gly Ala Thr 
515 520 525 

gtq gtq aac atg cgt ttc gtc aag ccg ttc gat caa gcc ttg gtq ctg 1632 
val val Asn Met Arg Phe Val Lys Pro Phe Asp Gin Ala Leu val Leu 
530 535 540 

gaa ttg gcc agg aeg cac gat gtq ttc gtc acc gtc gag gaa aac gtc 1680 
Glu Leu Ala Arg Thr His Asp val Phe val Thr val Glu Glu Asn val 
545 550 555 560 

ate gcc gqc gqc get gqc agt gcg ate aac ace ttc ctg cag gcg cag 1728 
lie Ala Gly Gly Ala Gly Ser Ala lie Asn Thr Phe Leu Gin Ala Gin 
565 570 575 

gtq ctg atg ccg gtc tge aac ate gqc ctg cec gac cgc ttc gtc 1776 
Lys val Leu Met Pro val cys Asn lie Gly Leu Pro Asp Arg Phe val 
580 585 590 

gag caa gqt agt cgc gag gaa ttg etc age ctg gtc gqc etc gac age 1824 
Glu Gin Gly Ser Arg Glu Glu Leu Leu Ser Leu Val Gly Leu Asp ser 
595 600 605 

aag gqc ate ttc gee ace ate gaa cag ttt tge get 1860 
Lys Gly lie Phe Ala Thr lie Glu Gin Phe Cys Ala 
610 615 620 

<210> 14 

<211> 620 

<212> PRT 

<213> Methyl omonas 16a 
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<400> 14 

Met Ala Leu ser Lys Asp Phe Pro Leu Leu Asn ser lie His Thr Pro 
15 10 15 

Ala Asp lie Arg Ala Leu Ser Lys Asp Gin Leu Gin Gin Leu Ala Asp 
20 25 30 

Glu val Arg Gly Tyr Leu Thr His Thr val ser lie Ser Gly Gly His 
35 40 45 

Phe Ala Ala Gly Leu Gly Thr val Glu Leu Thr val Ala Leu His Tyr 
50 55 60 

val Phe Asn Thr Pro val Asp Gin Leu Val Trp Asp val Gly His Gin 
65 70 75 80 

Ala Tyr Pro His Lys lie Leu Thr Gly Arg Lys Glu Arg Met Pro Thr 
85 90 95 

lie Arg Thr Leu Gly Gly Val Ser Ala Phe Pro Ala Arg Asp Glu Ser 
100 105 110 

Glu Tyr Asp Ala Phe Gly Val Gly His Ser ser Thr Ser lie Ser Ala 
115 120 125 

Ala Leu Gly Met Ala lie Ala ser Gin Leu Arg Gly Glu Asp Lys Lys 
130 135 140 

Met val Ala lie lie Gly Asp Gly Ser lie Thr Gly Gly Met Ala Tyr 
145 150 155 160 

Glu Ala Met Asn His Ala Gly Asp val Asn Ala Asn Leu Leu val lie 
165 170 175 

Leu Asn Asp Asn Asp Met Ser lie Ser Pro Pro val Gly Ala Met Asn 
180 185 190 

Asn Tyr Leu Thr Lys Val Leu Ser Ser Lys Phe Tyr Ser Ser val Arg 
195 200 205 

Glu Glu Ser Lys Lys Ala Leu Ala Lys Met Pro Ser val Trp Glu Leu 
210 215 220 

Ala Arg Lys Thr Glu Glu His Val Lys Gly Met lie val Pro Gly Thr 
225 230 235 240 

Leu Phe Glu Glu Leu Gly Phe Asn Tyr Phe Gly Pro lie Asp Gly His 
245 250 255 

Asp val Glu Met Leu val Ser Thr Leu Glu Asn Leu Lys Asp Leu Thr 
260 265 270 
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Gly Pro val Phe Leu His Val val Thr Lys Lys Gly Lys Gly Tyr Ala 
275 280 285 

Pro Ala Glu Lys Asp Pro Leu Ala Tyr His Gly val Pro Ala Phe Asp 
290 295 300 

Pro Thr Lys Asp Phe Leu Pro Lys Ala Ala Pro ser Pro His Pro Thr 
305 310 315 320 

Tyr Thr Glu val Phe Gly Arg Trp Leu Cys Asp Met Ala Ala Gin Asp 
325 330 335 

Glu Arg Leu Leu Gly lie Thr Pro Ala Met Arg Glu Gly Ser Gly Leu 
340 345 350 

Val Glu Phe ser Gin Lys Phe Pro Asn Arg Tyr Phe Asp val Ala lie 
355 360 365 

Ala Glu Gin His Ala val Thr Leu Ala Ala Gly Gin Ala Cys Gin Gly 
370 375 380 

Ala Lys Pro Val Val Ala lie Tyr Ser Thr Phe Leu Gin Arg Gly Tyr 
385 390 395 400 

Asp Gin Leu lie His Asp val Ala Leu Gin Asn Leu Asp Met Leu Phe 
405 410 415 

Ala Leu Asp Arg Ala Gly Leu val Gly Pro Asp Gly Pro Thr His Ala 
420 425 430 

Gly Ala Phe Asp Tyr Ser Tyr Met Arg Cys lie Pro Asn Met Leu lie 
435 440 445 

Met Ala Pro Ala Asp Glu Asn Glu Cys Arg Gin Met Leu Thr Thr Gly 
450 455 460 

Phe Gin His His Gly Pro Ala ser val Arg Tyr Pro Arg Gly Lys Gly 
465 470 475 480 

Pro Gly Ala Ala lie Asp Pro Thr Leu Thr Ala Leu Glu lie Gly Lys 
485 490 495 

Ala Glu Val Arg His His Gly Ser Arg lie Ala lie Leu Ala Trp Gly 
500 505 510 

ser Met val Thr Pro Ala val Glu Ala Gly Lys Gin Leu Gly Ala Thr 
515 520 525 

Val val Asn Met Arg Phe val Lys Pro Phe Asp Gin Ala Leu Val Leu 
530 535 540 



Page 21 



Glu Leu Ala Arg Thr His Asp val Phe val Thr val Glu Glu Asn val 
545 550 555 560 

lie Ala Gly Gly Ala Gly Ser Ala lie Asn Thr Phe Leu Gin Ala Gin 
565 570 575 

Lys val Leu Met Pro Val Cys Asn lie Gly Leu Pro Asp Arg Phe val 
580 585 590 

Glu Gin Gly ser Arg Glu Glu Leu Leu Ser Leu val Gly Leu Asp Ser 
595 600 605 

Lys Gly lie Phe Ala Thr lie Glu Gin Phe Cys Ala 
610 615 620 

<210> 15 

<211> 982 

<212> DNA 

<213> Methyl omonas 16a 
<220> 

<221> CDS 

<222> (22). .(975) 

<400> 15 

cccagtaaaa cactcaagaa t atg caa ate gta etc gea aae ecc egt gqa 51 

Met Gin lie Val Leu Ala Asn Pro Arg Gly 
15 10 

tte tgt gee gge gtg gae egg gee att gaa att gte gat caa gee ate 99 
Phe Cys Ala Gly Val Asp Arg Ala lie Glu lie val Asp Gin Ala lie 
15 20 25 

gaa gee ttt ggt geg eeg att tat gtg egg eae gag gtg gtg eat aae 147 
Glu Ala Phe Gly Ala Pro lie Tyr Val Arg His Glu Val Val His Asn 
30 35 40 

ege ace gtg gte gat gga etg aaa eaa aaa ggt geg gtg tte ate gag 195 
Arg Thr Val val Asp Gly Leu Lys Gin Lys Gly Ala val Phe lie Glu 
45 50 55 

gaa eta age gat gtg ccg gtg ggt tec tac ttg att tte age geg eae 243 
Glu Leu Ser Asp val Pro val Gly Ser Tyr Leu lie Phe Ser Ala His 
60 65 70 

gge gta tec aag gag gtg eaa eag gaa gee gag gag ege eag ttg acg 291 
Gly val ser Lys Glu val Gin Gin Glu Ala Glu Glu Arg Gin Leu Thr 
75 80 85 90 

gta tte gat geg act tgt eeg etg gtg ace aaa gtg eae atg eag gtt 339 
val Phe Asp Ala Thr Cys Pro Leu Val Thr Lys Val His Met Gin val 
95 100 105 

gee aag eat gee aaa eag gge cga gaa gtg att ttg ate gge eae gee 387 
Ala Lys His Ala Lys Gin Gly Arg Glu val lie Leu lie Gly His Ala 
110 115 120 



ggt eat eeg gaa gtg gaa gge aeg atg gge eag tat gaa aaa tge ace 
Gly His pro Glu val Glu Gly Thr Met Gly Gin Tyr Glu Lys Cys Thr 
125 130 135 

gaa gge gge gge att tat etg gte gaa act eeg gaa gae gta ege aat 
Glu Gly Gly Gly lie Tyr Leu Val Glu Thr Pro Glu Asp val Arg Asn 
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435 



483 



140 145 150 

ttg aaa gtc aac aat ccc aat gat ctg gcc tat gtq acg cag acg acc 531 
Leu Lys val Asn Asn Pro Asn Asp Leu Ala Tyr val Thr Gin Thr Thr 
155 160 165 170 

ttg teg atg acc gac acc aag gtc atg gtq gat gcg tta cgc gaa caa 579 
Leu sen Met Thr Asp Thr Lys val Met val Asp Ala Leu Arg Glu Gin 
175 180 185 

ttt ccg tec att aag gag caa aaa aag gac gat att tgt tac gcg acg 627 
Phe Pro Ser lie Lys Glu Gin Lys Lys Asp Asp lie Cys Tyr Ala Thr 
190 195 200 

caa aac cgt cag gat gcg gtq cat gat ctg gcc aag att tec gac ctg 675 
Gin Asn Arg Gin Asp Ala val His Asp Leu Ala Lys lie Ser Asp Leu 
205 210 215 

att ctg gtt gtc gqc tct ccc aat agt teg aat tee aac egt ttg egt 723 
lie Leu val val Gly Ser Pro Asn ser ser Asn ser Asn Arg Leu Arg 
220 225 230 

gaa ate gee gtq eaa etc gqt aaa eee get tat ttg ate gat act tac 771 
Glu lie Ala Val Gin Leu Gly Lys Pro Ala Tyr Leu lie Asp Thr Tyr 
235 240 245 250 

eag gat ttg aag eaa gat tgg etg gag gqa att gaa gta gtc ggg gtt 819 
Gin Asp Leu Lys Gin Asp Trp Leu Glu Gly lie Glu val Val Gly val 
255 260 265 

aee geg gqe get teg gcg ccg gaa gtq ttg gtq eag gaa gtq ate gat 867 
Thr Ala Gly Ala Ser Ala Pro Glu Val Leu val Gin Glu val lie Asp 
270 275 280 

eaa etg aag gea tgg gqe gqc gaa ace act teg gtc aga gaa aac age 915 
Gin Leu Lys Ala Trp Gly Gly Glu Thr Thr Ser Val Arg Glu Asn Ser 
285 290 295 

gqe ate gag gaa aag gta gtc ttt teg att ccc aag gag ttg aaa aaa 963 
Gly lie Glu Glu Lys val val Phe ser lie Pro Lys Glu Leu Lys Lys 
300 305 310 

cat atg caa geg tgatcaa 982 

His Met Gin Ala 

315 



<210> 16 
<211> 318 
<212> PRT 

<213> Methyl omonas 16a 
<400> 16 

Met Gin lie Val Leu Ala Asn Pro Arg Gly Phe Cys Ala Gly val Asp 
15 10 15 



Arg Ala lie Glu lie val Asp Gin Ala lie Glu Ala Phe Gly Ala Pro 
20 25 30 



lie Tyr val Arg His Glu val val His Asn Arg Thr Val Val Asp Gly 
35 40 45 



Leu Lys Gin Lys Gly Ala val Phe lie Glu Glu Leu ser Asp val Pro 
50 55 60 
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val Gly ser Tyr Leu lie Phe Ser Ala His Gly val Ser Lys Glu val 
65 70 75 80 

Gin Gin Glu Ala Glu Glu Arg Gin Leu Thr val Phe Asp Ala Thr Cys 
85 90 95 

Pro Leu val Thr Lys Val His Met Gin val Ala Lys His Ala Lys Gin 
100 105 110 

Gly Arg Glu val lie Leu lie Gly His Ala Gly His Pro Glu val Glu 
115 120 125 

Gly Thr Met Gly Gin Tyr Glu Lys Cys Thr Glu Gly Gly Gly lie Tyr 
130 135 140 

Leu val Glu Thr Pro Glu Asp val Arg Asn Leu Lys val Asn Asn Pro 
145 150 155 160 

Asn Asp Leu Ala Tyr val Thr Gin Thr Thr Leu ser Met Thr Asp Thr 
165 170 175 

Lys val Met Val Asp Ala Leu Arg Glu Gin Phe Pro ser lie Lys Glu 
180 185 190 

Gin Lys Lys Asp Asp lie Cys Tyr Ala Thr Gin Asn Arg Gin Asp Ala 
195 200 205 

val His Asp Leu Ala Lys lie Ser Asp Leu lie Leu val val Gly ser 
210 215 220 

Pro Asn ser Ser Asn ser Asn Arg Leu Arg Glu lie Ala val Gin Leu 
225 230 235 240 

Gly Lys Pro Ala Tyr Leu lie Asp Thr Tyr Gin Asp Leu Lys Gin Asp 
245 250 255 

Trp Leu Glu Gly lie Glu val val Gly val Thr Ala Gly Ala ser Ala 
260 265 270 

Pro Glu Val Leu val Gin Glu val lie Asp Gin Leu Lys Ala Trp Gly 
275 280 285 

Gly Glu Thr Thr Ser val Arg Glu Asn Ser Gly lie Glu Glu Lys val 
290 295 300 

val Phe Ser lie Pro Lys Glu Leu Lys Lys His Met Gin Ala 
305 310 315 

<210> 17 
<211> 1254 
<212> DNA 

<213> Methyl omonas 16a 
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<220> 

<221> CDS 

<222> (73). .(1254) 

<400> 17 

ggtggacagc atcattgcgg cggcaccgtt tttctatgcc ggtatcgtgc tgatcggacg 60 

gagcgtattc ga atg aaa ggt att tgc ata ttg ggc get acc gqt teg ate 111 
Met Lys Gly lie Cys lie Leu Gly Ala Thr Gly ser He 
15 10 

ggt gtc age aeg etg gat gtc gtt gee agg eat eeg gat aaa tat eaa 159 
Gly val Ser Thr Leu Asp val val Ala Arg His Pro Asp Lys Tyr Gin 
15 20 25 

gtc gtt geg etg aee gee aae gge aat ate gae gea ttg tat gaa eaa 207 
val Val Ala Leu Thr Ala Asn Gly Asn lie Asp Ala Leu Tyr Glu Gin 
30 35 40 45 

tge etg gee eac cat eeg gag tat geg gtg gtg gtc atg gaa age aag 255 
Cys Leu Ala His His Pro Glu Tyr Ala val val val Met Glu ser Lys 
50 55 60 

gta gca gag tte aaa eag ege att gee get teg eeg gta geg gat ate 303 
Val Ala Glu Phe Lys Gin Arg lie Ala Ala Ser Pro val Ala Asp He 
65 70 75 

aag gtc ttg teg gqt age gag gee ttg eaa eag gtg gee aeg etg gaa 351 
Lys Val Leu Ser Gly Ser Glu Ala Leu Gin Gin val Ala Thr Leu Glu 
80 85 90 

aae gte gat aeg gtg atg geg get ate gtc gge geg gee gga ttg ttg 399 
Asn val Asp Thr val Met Ala Ala lie Val Gly Ala Ala Gly Leu Leu 
95 100 105 

eeg aee ttg gee geg gee aag gee gge aaa aee gtg etg ttg gee aae 447 
Pro Thr Leu Ala Ala Ala Lys Ala Gly Lys Thr val Leu Leu Ala Asn 
110 115 120 125 

aag gaa gee ttg gtg atg teg gga eaa ate tte atg eag gee gte age 495 
Lys Glu Ala Leu val Met ser Gly Gin lie Phe Met Gin Ala val ser 
130 135 140 

gat tee gge get gtg ttg etg eeg ata gae age gag cae aae gee ate 543 
Asp ser Gly Ala val Leu Leu Pro lie Asp Ser Glu His Asn Ala lie 
145 150 155 

ttt eag tge atg eeg geg ggt tat aeg cea gge eat aea gee aaa eag 591 
Phe Gin Cys Met Pro Ala Gly Tyr Thr Pro Gly His Thr Ala Lys Gin 
160 165 170 

geg ege ege att tta ttg aee get tec ggt ggc cea ttt cga egg aeg 639 
Ala Arg Arg lie Leu Leu Thr Ala Ser Gly Gly Pro Phe Arg Arg Thr 
175 180 185 

eeg ata gaa aeg ttg tec age gte aeg eeg gat eag gee gtt gee eat 687 
Pro lie Glu Thr Leu Ser ser val Thr Pro Asp Gin Ala val Ala His 
190 195 200 205 

ect aaa tgg gae atg ggg ege aag att teg gtc gat tec gee acc atg 735 
Pro Lys Trp Asp Met Gly Arg Lys lie Ser Val Asp Ser Ala Thr Met 
210 215 220 

atg aae aaa ggt etc gaa etg ate gaa gee tgc ttg ttg tte aae atg 783 
Met Asn Lys Gly Leu Glu Leu lie Glu Ala Cys Leu Leu Phe Asn Met 
225 230 235 
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gag ccc gac cag att gaa gtc gtc att cat ccg cag age ate att cat 831 

Glu Pro Asp Gin lie Glu val val lie His Pro Gin ser lie He His 

240 245 250 

teg atg gtg gac tat gtc gat gqt teg gtt ttg gcg cag atg ggt aat 879 

Ser Met Val Asp Tyr Val Asp Gly Ser val Leu Ala Gin Met Gly Asn 
255 260 265 

ccc gac atg ege aeg ccg ata gcg cac gcg atg gee tgg ccg gaa egc 927 

Pro Asp Met Arg Thr Pro lie Ala His Ala Met Ala Trp Pro Glu Arg 
270 275 280 285 



ttt gac tet ggt gtg gcg ccg etg gat att ttc gaa gta ggg cac atg 
Phe Asp ser Gly val Ala Pro Leu Asp lie Phe Glu Val Gly His Met 
290 295 300 



975 



gat tte gaa aaa ccc gac ttg aaa egg ttt cet tgt etg aga ttg get 1023 
Asp Phe Glu Lys Pro Asp Leu Lys Arg Phe Pro Cys Leu Arg Leu Ala 
305 ' 310 315 

tat gaa gcc ate aag tet ggt gga att atg cea aeg gta ttg aae gca 1071 
Tyr Glu Ala lie Lys Ser Gly Gly lie Met Pro Thr val Leu Asn Ala 
320 325 330 

gee aat gaa att get gtc gaa gcg ttt tta aat gaa gaa gtc aaa tte 1119 
Ala Asn Glu lie Ala val Glu Ala Phe Leu Asn Glu Glu val Lys Phe 
335 340 345 

act gac ate gcg gtc ate ate gag egc age atg gee cag ttt aaa ccg 1167 
Thr Asp lie Ala Val lie He Glu Arg ser Met Ala Gin Phe Lys Pro 
350 355 360 365 

gac gat gcc ggc age etc gaa ttg gtt ttg cag gee gat caa gat gcg 1215 
Asp Asp Ala Gly ser Leu Glu Leu Val Leu Gin Ala Asp Gin Asp Ala 
370 375 380 

egc gag gtg get aga gac ate ate aag ace ttg gta get 1254 
Arg Glu val Ala Arg Asp lie lie Lys Thr Leu Val Ala 
385 390 

<210> 18 
<211> 394 
<212> PRT 

<213> Methyl omonas 16a 
<400> 18 

Met Lys Gly lie Cys He Leu Gly Ala Thr Gly Ser lie Gly val Ser 
15 10 15 

Thr Leu Asp val val Ala Arg His Pro Asp Lys Tyr Gin val val Ala 
20 25 30 

Leu Thr Ala Asn Gly Asn lie Asp Ala Leu Tyr Glu Gin Cys Leu Ala 
35 40 45 

His His Pro Glu Tyr Ala val Val Val Met Glu ser Lys val Ala Glu 
50 55 60 

Phe Lys Gin Arg lie Ala Ala ser Pro val Ala Asp lie Lys val Leu 
65 70 75 80 



ser Gly Ser Glu Ala Leu Gin Gin Val Ala Thr Leu Glu Asn val Asp 
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85 



90 



95 



Thr val Met Ala Ala lie val Gly Ala Ala Gly Leu Leu Pro Thr Leu 
100 105 110 

Ala Ala Ala Lys Ala Gly Lys Thr val Leu Leu Ala Asn Lys Glu Ala 
115 120 125 

Leu Val Met Ser Gly Gin lie Phe Met Gin Ala val ser Asp ser Gly 
130 135 140 

Ala val Leu Leu Pro lie Asp ser Glu His Asn Ala lie Phe Gin cys 
145 150 155 160 

Met Pro Ala Gly Tyr Thr Pro Gly His Thr Ala Lys Gin Ala Arg Arg 
165 170 175 

lie Leu Leu Thr Ala ser Gly Gly Pro Phe Arg Arg Thr Pro lie Glu 
180 185 190 

Thr Leu ser ser val Thr pro Asp Gin Ala val Ala His pro Lys Trp 
195 200 205 

Asp Met Gly Arg Lys lie Ser val Asp ser Ala Thr Met Met Asn Lys 
210 215 220 

Gly Leu Glu Leu lie Glu Ala cys Leu Leu Phe Asn Met Glu Pro Asp 
225 230 235 240 

Gin lie Glu val val lie His Pro Gin ser lie lie His ser Met val 
245 250 255 

Asp Tyr Val Asp Gly Ser val Leu Ala Gin Met Gly Asn Pro Asp Met 
260 265 270 

Arg Thr Pro lie Ala His Ala Met Ala Trp Pro Glu Arg Phe Asp Ser 

275 280 285 

Gly val Ala Pro Leu Asp lie Phe Glu Val Gly His Met Asp Phe Glu 
290 295 300 

Lys Pro Asp Leu Lys Arg Phe Pro Cys Leu Arg Leu Ala Tyr Glu Ala 
305 310 315 320 

lie Lys Ser Gly Gly lie Met Pro Thr val Leu Asn Ala Ala Asn Glu 
325 330 335 

lie Ala val Glu Ala Phe Leu Asn Glu Glu val Lys Phe Thr Asp lie 
340 345 350 



Ala Val lie lie Glu Arg ser Met Ala Gin Phe Lys Pro Asp Asp Ala 
355 360 365 
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Gly ser Leu Glu Leu val Leu Gin Ala Asp Gin Asp Ala Arg Glu val 
370 375 380 



Ala Arg Asp lie lie Lys Thr Leu val Ala 
385 390 

<210> 19 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #1 for amplification of crt gene cluster 

<400> 19 

atgacggtct gcgcaaaaaa acacg 25 

<210> 20 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #2 for amplification of crt gene cluster 

<400> 20 

gagaaattat gttgtggatt tggaatgc 28 

<210> 21 

<211> 61 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 5'kanCdxs) 

<400> 21 

tggaagcgct agcggactac atcatccagc gtaataaata acgtcttgag cgattgtgta 60 
g 61 

<210> 22 

<211> 65 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer 5'kan(idi) 

<400> 22 

tctgatgcgc aagctgaaga aaaatgagca tggagaataa tatgacgtct tgagcgattg 60 
tgtag 65 

<210> 23 

<211> 65 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 5"kan(ygbBP) 
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<400> 23 

gacgcgtcga agcgcgcaca gtctgcgggg caaaacaatc gataacgtct tgagcgattg 60 
tgtag 65 

<210> 24 

<211> 60 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 5'kan(ispAdxs) 

<400> 24 

accatgacgg ggcgaaaaat attgagagtc agacattcat gtgtaggctg gagctgcttc 60 

<210> 25 

<211> 64 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer 3'kan 

<400> 25 

gaagacgaaa gggcctcgtg atacgcctat ttttataggt tatatgaata tcctccttag 60 
ttcc 64 

<210> 26 

<211> 50 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer 5"-T5 

<400> 26 

ctaaggagga tattcatata acctataaaa ataggcgtat cacgaggccc 50 

<210> 27 
<211> 70 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 3'-T5(dxs) 
<400> 27 

ggagtcgacc agtgccaggg tcgggtattt ggcaatatca aaactcatag ttaatttctc 60 
ctctttaatg 70 

<210> 28 
<211> 68 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 3'-T5Cicli) 
<400> 28 

tgggaactcc ctgtgcattc aataaaatga cgtgttccgt ttgcatagtt aatttctcct 60 
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ctttaatg 



68 



<210> 29 
<211> 68 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer 3'-T5(ygbBP) 
<400> 29 

cggccgccgg aaccacggcg caaacatcca aatgagtggt tgccatagtt aatttctcct 
ctttaatg 



60 
68 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



30 
62 
DNA 

Artificial sequence 
Primer 3' -T5(ispAdxs) 



<400> 30 

cctgcttaac gcaggcttcg agttgctgcg gaaagtccat agttaatttc tcctctttaa 60 
tg 62 



<210> 31 

<211> 65 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 5'-kanT5(ispB) 

<400> 31 

accataaacc ctaagttgcc tttgttcaca gtaaggtaat cggggcgtct tgagcgattg 
tgtag 



60 
65 



<210> 32 
<211> 67 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer 3 ' -kanT5Ci spB) 
<400> 32 

cgccatatct tgcgcggtta actcattgat tttttctaaa ttcatagtta atttctcctc 
tttaatg 



60 
67 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 

<400> 



33 

156 

DNA 

Artificial sequence 

Phage T5 promoter sequence 
33 
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ctataaaaat aggcgtatca cgaggccctt tcgtcttcac ctcgagaaat cataaaaaat 
ttatttgctt tgtgagcgga taacaattat aatagattca attgtgagcg gataacaatt 
tcacacagaa ttcattaaag aggagaaatt aactca 



<210> 34 

<211> 65 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 5 ' -kanT5(dxsl6a) 

<400> 34 

cactaacgcc cgcacattgc tgcgggcttt 
tgtag 

<210> 35 

<211> 65 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer 5 ' -kanT5Cdxrl6a) 

<400> 35 

taaagggcta agagtagtgt gctcttagcc 
tgtag 



<210> 36 

<211> 65 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer 5 " -kanT5ClytBl6a) 

<400> 36 

ctacaactgg cgagatgcat agcgagtata 
tgtag 



<210> 37 

<211> 51 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 3 ' -kanT5Cclxsl6a) 

<400> 37 

agtagaggga agtctttgga aagagccata 



<210> 38 

<211> 51 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 3 ' -kanTSCdxrlSa) 



ttgattcatt tcgcacgtct tgagcgattg 



cttaattacg tttcccgtct tgagcgattg 



atttgtattt tgcgtcgtct tgagcgattg 



gttaatttct cctctttaat g 



Page 31 



<400> 38 

acggtgccgc cgcaatgatg ctgtccacca gttaatttct cctctttaat g 



51 



<210> 39 

<211> 51 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer 3 ' -kanT5ClytBl6a) 

<400> 39 

ccacgggggt ttgcgagtac gatttgcata gttaatttct cctctttaat g 51 

<210> 40 

<211> 55 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 5'-Cdxsl6a) 

<400> 40 

acagaattca ttaaagagga gaaattaact atggctcttt ccaaagactt ccctc 55 

<210> 41 

<211> 55 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 5'-Cdxrl6a) 

<400> 41 

acagaattca ttaaagagga gaaattaact ggtggacagc atcattgcgg cggca 55 

<210> 42 

<211> 55 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer 5 ' -* ClytBl6a) 

<400> 42 

acagaattca ttaaagagga gaaattaact atgcaaatcg tactcgcaaa ccccc 55 

<210> 43 
<211> 68 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer 3'-Cdxsl6a) 
<400> 43 

aggagcgaag tgattatcag tatgctgttc atatagcctc gaattatcaa gcgcaaaact 60 
gttcgatg 68 

<210> 44 
<211> 67 
<212> DNA 
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<213> Artificial sequence 



<220> 

<223> Primer 3'-(dxrl6a) 



<400> 44 

ggcattttca ctctggcaat gcgcataaac gctttcaaag tcctgttaag ctaccaaggt 



60 



cttgatg 



67 



<210> 45 
<211> 68 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 3'-(lytBl6a) 
<400> 45 

agtggcggac gggcaaacaa gggtaacata ggatcaatga gggttattga tcacgcttgc 60 
atatgttt 68 



<210> 46 

<211> 2 5 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer T-kan 



<210> 47 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer B-ispA 

<400> 47 

cctaataatg cgccatactg catgg 25 



<210> 48 

<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer T-T5 



<210> 49 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer B-idi 



<400> 46 

accggatatc accacttatc tgctc 



25 



<400> 48 

taacctataa aaataggcgt atcacgaggc cc 



32 
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<400> 49 

tcatgctgac ctggtgaagg aatcc 



<210> 50 

<211> 25 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer B-dxs(16a) 

<400> 50 

gcgatattgt atgtctgatt cagga 



<210> 51 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer B-lytB(16a) 

<400> 51 

tccactggat gcgggaagct ggcag 



<210> 52 

<211> 26 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer B-dxs 

<400> 52 

tggcaacagt cgtagctcct gggtgg 



<210> 53 

<211> 25 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer B-ygb 

<400> 53 

ccagcagcgc atgcaccgag tgttc 



<210> 54 

<211> 21 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer Tn5PCRF 

<400> 54 

gctgagttga aggatcagat c 



<210> 55 

<211> 21 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Primer TnSPCRR 



<400> 55 

cgagcaagac gtttcccgtt g 



21 



<210> 56 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer Kan-2 FP-1 

<400> 56 

acctacaaca aagctctcat caacc 25 



<210> 57 

<211> 25 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer Kan-2 RP-1 



<210> 58 

<211> 20 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Primer Y15_F 

<400> 58 

ggatcgatct tgagatgacc 20 



<210> 59 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer Y15_R 



<210> 60 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer T-Tn5yjeR 

<400> 60 

gcaatgtaac atcagagatt ttgag 25 



<400> 57 

gcaatgtaac atcagagatt ttgag 



25 



<400> 59 

gctttcgtaa ttttcgcatt tctg 



24 



<210> 61 
<211> 24 
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<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer B-yjeR 
<400> 61 

gctttcgtaa ttttcgcatt tctg 24 



<210> 62 

<211> 26 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer B-ispB 

<400> 62 

agtacagcaa tcatcggacg aatacg 26 

<210> 63 

<211> 1845 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Sequence yjeR: :Tn5 mutant gene (transposon disrupted yjeR) 



<400> 63 
atgggcaaaa 


catctatgat 


acacgcaatt 


gtggatcaat 


atagtcactg 


tgaatgggtg 


60 


gaaaatagca 


tgagtgccaa 


tgaaaacaac 


ctgatttgga 


tcgatcttga 


gatgaccggt 


120 


ctggatcccg 


agcgcgatcg 


cattattgag 


attgccacgc 


tggtgaccga 


tgccaacctg 


180 


aatattctgg 


cagaagggcc 


gaccattgca 


gtacaccagt 


ctgatgaaca 


gctggcgctg 


240 


atggatgact 


ggaacgtgcg 


cacccatacc 


gccagcgggc 


tggtagagcg 


cgtgaaagcg 


300 


agcacgatgg 


gcgatcggga 


agctgaactg 


gcaacgctcg 


aatttttaaa 


acagtgggtg 


360 


cctgcgggaa 


aatcgccgat 


ttgcggtaac 


agcatcggtc 


aggaccgtcg 


tttcctgttt 


420 


aaatacatgc 


cggagctgga 


agcctacttc 


cactaccgtt 


atctcgatgt 


cagcaccctg 


480 


aaagagctgg 


cgcgccgctg 


gaagccggaa 


attctggatg 


gttttaccaa 


gcaggggacg 


540 


catcaggcga 


tggatgatat 


ccgtgaatcg 


gtggcggagc 


tggcttacta 


cctgtctctt 


600 


atacacatct 


caaccctgaa 


gcttgcatgc 


ctgcaggtcg 


actctagagg 


atccccgcca 


660 


cggttgatga 


gagctttgtt 


gtaggtggac 


cagttggtga 


ttttgaactt 


ttgctttgcc 


720 


acggaacggt 


ctgcgttgtc 


gggaagatgc 


gtgatctgat 


ccttcaactc 


agcaaaagtt 


780 


cgatttattc 


aacaaagccg 


ccgtcccgtc 


aagtcagcgt 


aatgctctgc 


cagtgttaca 


840 


accaattaac 


caattctgat 


tagaaaaact 


catcgagcat 


caaatgaaac 


tgcaatttat 


900 


tcatatcagg 


attatcaata 


ccatattttt 


gaaaaagccg 


tttctgtaat 


gaaggagaaa 


960 


actcaccgag 


gcagttccat 


aggatggcaa 


gatcctggta 


tcggtctgcg 


attccgactc 


1020 


gtccaacatc 


aatacaacct 


attaatttcc 


cctcgtcaaa 


aataaggtta 


tcaagtgaga 


1080 


aatcaccatg 


agtgacgact 


gaatccggtg 


agaatggcaa 


aagtttatgc 


atttctttcc 


1140 
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dydC L lQ L lC 


aacay gccag 


ccaxxacgcx 


cgxcaxcaaa 


axcacxcgca 


xcaaccaaac 


±ZUU 


/-/-i-t--f- ■:j-t--t-r--3-t- 
Cy tXd. L LCd. L 


Lcg tya L Lgc 


yccxgagcga 


yacyaaaxac 


gcgaxcgcxg 


xxaaaaggac 






ciygaa Lcy aa 


xgcaaccyy c 


9 cay 9^^^^^ 


Xyccagcgca 


xcaacaaxax 


jldZ.\j 


LuLCd.CCX.gcl 


3 f~ -3 /-« o "t~ ^ "t~ 

d. L Cay y a. L aX. 


"f- /" 1- "h i- a a -t- a 

xcxxcxaaxa 


ccxgyaaxgc 


xgxxxxxccg 


gggaxcgcag 


±3<SU 


-♦- /~i /"I "t" n ^ /I "t" 3 a 

Lyy xyay Ld.<i 


CCd Xy Ca LCa 


xcayyayxac 


ygaxaaaaxg 


cxxgaxggxc 


ggaagaggca 


1 /I >! n 


^ «5 "a /~ *~ ^ 

LaaaL LCCg X 


^ ^ « ^ ^ +■ '4~ 

cagccagxxx 


agxcxgacca 


xcxcaxcxgx 


M| M <Aw JIM M| «§W iAs 

aacaxcaxxg 


gcaacgcxac 


1 C f\f\ 


ct Lxgccaxy 


xxxcagaaac 


aacxcxggcy 


caxcgggcxx 


cccatiacaat: 


cgatagalitig 


1 C Cif\ 

IdoO 


Lcgcdcc xga. 


xxgcccgaca 


xxaxcgcgay 


cccaxxxaxa 


cccatiataaa 


tcagcatcca 




tgttggaatt 


taatcgcggc 


ctcgagcaag 


acgtttcccg 


ttgaatatgg 


ctcataacac 


1680 


cccttgtatt 


actgtttatg 


taagcagaca 


gttttattgt 


tcatgatgat 


atatttttat 


1740 


cttgtgcaat 


gtaacatcag 


agattttgag 


acacaattca 


tcgatgatgg 


ttgagatgtg 


1800 


tataagagac 


aggcttacta 


ccgcgagcat 


tttatcaagc 


tgtaa 




1845 


<210> 64 
<211> 8609 
<212> DNA 

<213> Artificial sequence 










<220> 

<223> Plasmid pPCBlS 












<400> 64 
cgtatggcaa 


tgaaagacgg 


tgagctggtg 


axaxgggaxa 


gxgxxcaccc 


xxgxxacacc 


DU 


gttttccatg 


agcaaactga 


aacgttttca 


xcgcxcxgga 


gxgaaxacca 


cgacgaxxxc 


±ZU 


cggcagtttc 


tacacatata 


ttcgcaagat 


gxyycgxgxx 


acggxgaaaa 


ccxggccxax 


1 Qf\ 

±oU 


ttccctaaag 


ggtttattga 


gaatatgttt 


X xcg xcxcag 


ccaaxcccxg 


ggxgagtttc 


o >i n 


accagttttg 


atttaaacgt 


ggccaatatg 


gacaacttct 


tcgcccccgt 


tttcaccatg 


300 


ggcaaatatt 


atacgcaagg 


cgacaaggtg 


cxgaxgccgc 


xggcgaxxca 


ggttcatcat 




gccgtctgtg 


atggcttcca 


tgtcggcaga 


axgcxxaa xg 


aattacaaca 


gtactgcgat: 


4zU 


gagtggcagg 


gcggggcgta 


atttttttaa 


ggcagttatt 


ggtgcctaga 


aaLaxtxtax 


480 


ctgattaata 


agatgatctt 


cttgagatcg 


xxxxggtc xg 


cgcgtaatct 


cx xgcxcxga 


540 


aaacgaaaaa 


accgccttgc 


agggcggttt 


t'tcgaagg'tt: 


ctctgagcta 


ccaacxcxxx 


CAA 


gaaccgaggt 


aactggcttg 


gaggagcgca 


gticaccaaaa 


cxtgxccxxx 


cagirttagcc 


bbU 


ttaaccggcg 


catgacttca 


agactaactc 


cxcxaaaxca 


axxaccagxg 


gcxgcxgcca 


/ 


gtggtgcttt 


tgcatgtctt 


tccgggttgg 


acxcaagacg 


axagxxaccg 


gaxaaggcgc 


7Q A 


agcggtcgga 


ctgaacgggg 


ggttcgtgca 


xacagxccag 


cxxggagcga 


acxgccxacc 


C y1 A 

o4U 


cggaactgag 


tgtcaggcgt 


ggaatgagac 


aaacgcggcc 


ataacagcgg 


aatgacaccg 


900 


gtaaaccgaa 


aggcaggaac 


aggagagcgc 


acgagggagc 


cgccagggga 


aacgcctggt 


960 


atctttatag 


tcctgtcggg 


tttcgccacc 


actgatttga 


gcgtcagatt 


tcgtgatgct 


1020 


tgtcaggggg 


gcggagccta 


tggaaaaacg 


gctttgccgc ggccctctca 
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cttccctgtt 


1080 



aagtatcttc 


ctggcatctt 


ccaggaaatc 


tccgccccgt 


tcgtaagcca 


tttccgctcg 


1140 


ccgcagtcga 


acgaccgagc 


gtagcgagtc 


agtgagcgag 


gaagcggaat 


atatcctgta 


1200 


tcacatattc 


tgctgacgca 


ccggtgcagc 


cttttttctc 


ctgccacatg 


aagcacttca 


1260 


ctgacaccct 


catcagtgcc 


aacatagtaa 


gccagtatat 


acactccgct 


agcgcccaat 


1320 


acgcaaaccg 


cctctccccg 


cgcgttggcc 


gattcattaa 


tgcagctggc 


acgacaggtt 


1380 


tcccgactgg 


aaagcgggca 


gtgagcgcaa 


cgcaattaat 


gtgagttagc 


tcactcatta 


1440 


ggcaccccag 


gctttacact 


ttatgcttcc 


ggctcgtatg 


ttgtgtggaa 


ttgtgagcgg 


1500 


ataacaattt 


cacacaggaa 


acagctatga 


ccatgattac 


gaattcgagc 


tcggtaccca 


1560 


aacgaattcg 


cccttttgac 


ggtctgcgca 


aaaaaacacg 


ttcaccttac 


tggcatttcg 


1620 


gctgagcagt 


tgctggctga 


tatcgatagc 


cgccttgatc 


agttactgcc 


ggttcagggt 


1680 


gagcgggatt 


gtgtgggtgc 


cgcgatgcgt 


gaaggcacgc 


tggcaccggg 


caaacgtatt 


1740 


cgtccgatgc 


tgctgttatt 


aacagcgcgc 


gatcttggct 


gtgcgatcag 


tcacggggga 


1800 


ttactggatt 


tagcctgcgc 


ggttgaaatg 


gtgcatgctg 


cctcgctgat 


tctggatgat 


1860 


atgccctgca 


tggacgatgc 


gcagatgcgt 


cgggggcgtc 


ccaccattca 


cacgcagtac 


1920 


ggtgaacatg 


tggcgattct 


ggcggcggtc 


gctttactca 


gcaaagcgtt 


tggggtgatt 


1980 


gccgaggctg 


aaggtctgac 


gccgatagcc 


aaaactcgcg 


cggtgtcgga 


gctgtccact 


2040 


gcgattggca 


tgcagggtct 


ggttcagggc 


cagtttaagg 


acctctcgga 


aggcgataaa 


2100 


ccccgcagcg 


ccgatgccat 


actgctaacc 


aatcagttta 


aaaccagcac 


gctgttttgc 


2160 


gcgtcaacgc 


aaatggcgtc 


cattgcggcc 


aacgcgtcct 


gcgaagcgcg 


tgagaacctg 


2220 


catcgtttct 


cgctcgatct 


cggccaggcc 


tttcagttgc 


ttgacgatct 


taccgatggc 


2280 


atgaccgata 


ccggcaaaga 


catcaatcag 


gatgcaggta 


aatcaacgct 


ggtcaattta 


2340 


ttaggctcag 


gcgcggtcga 


agaacgcctg 


cgacagcatt 


tgcgcctggc 


cagtgaacac 


2400 


ctttccgcgg 


catgccaaaa 


cggccattcc 


accacccaac 


tttttattca 


ggcctggttt 


2460 


gacaaaaaac 


tcgctgccgt 


cagttaagga 


tgctgcatga 


gccattttgc 


ggtgatcgca 


2520 


ccgccctttt 


tcagccatgt 


tcgcgctctg 


caaaaccttg 


ctcaggaatt 


agtggcccgc 


2580 


ggtcatcgtg 


ttacgttttt 


tcagcaacat 


gactgcaaag 


cgctggtaac 


gggcagcgat 


2640 


atcggattcc 


agaccgtcgg 


actgcaaacg 


catcctcccg 


gttccttatc 


gcacctgctg 


2700 


cacctggccg 


cgcacccact 


cggaccctcg 


atgttacgac 


tgatcaatga 


aatggcacgt 


2760 


accagcgata 


tgctttgccg 


ggaactgccc 


gccgcttttc 


atgcgttgca 


gatagagggc 


2820 


gtgatcgttg 


atcaaatgga 


gccggcaggt 


gcagtagtcg 


cagaagcgtc 


aggtctgccg 


2880 


tttgtttcgg 


tggcctgcgc 


gctgccgctc 


aaccgcgaac 


cgggtttgcc 


tctggcggtg 


2940 


atgcctttcg 


agtacggcac 


cagcgatgcg 


gctcgggaac 


gctataccac 


cagcgaaaaa 


3000 


atttatgact 


ggctgatgcg 


acgtcacgat 


cgtgtgatcg 


cgcatcatgc 


atgcagaatg 


3060 


ggtttagccc 


cgcgtgaaaa 


actgcatcat 


tgtttttctc 


cactggcaca 


aatcagccag 


3120 
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L Lgd. xccccg 


dcixxgga.xxx 


'^'fcfffyf'^ a a 
XXXXXyXddd 


yxy X Lyxxdy 


dXLyXLLLi-d 


xyxyy xxyyd 


J JLOU 




ddc(.(.i.dyyy 


gacgccgggg 


'1~r"a'fT*aa^"l""l~ 
Ll-dLXddv.L L 


r- 1" 1- a 1- i" 1- 1- 1" r* 

XLLdLLLLL.L. 


y LL.\-i.L.yydX 


^740 




LLLLLyd.Ll- 


yxLyyyxdxx 


X Ly xdy y ydx 


dLxy L Ld Lyy 


Ly L LXdyy 




a C. L. a L v.y I. V. d 


ddyu.v. Lyi.yd 


dydyy Lyyd l 


nr"nr"an^^ar" 
yxyxdy l Ldc 


Ly L Lyyi.ciL.a 


X Ly Lyyxyy X 




CXC LC.dy Cv.d 


(.ycdyycdyy 


Lyddx Lyy XX 


cggggxgggg 


dXd L LXdyy l 


Ly Lyydxxxx 




yv.CydUi.ddl. 


uuyi.dycdu L 


•♦-■t"r"ar"annr"a 
L LXdXdyy Xd 


^ -» /Tt-"t"n a r* a a 
Xdy L LydXdd 


LXdXdXd Lyy 


+^rinoa+"naa^ 

xyyydxyadx 




dcyy Ldcxyy 


dcyc.xdxxyc 


X XXXXyXdXd 


xxyxxdxxyy 


xy X Ly xxy X l 


yy xdxxxydx 


J jf u 


Cddccuyycy 


xyy cd LWiduy 


ddXXy L L Ld L 


xdxyy xdxxy 


y xddy xy xyx 


yxxxxyyxLx 


J ouu 


dc Ldv.v.dyc.c. 


dxycycxyyu 


yxyyxdydxx 


xyd Lxy xxyx 


^na^+'aa^a^ 

xydxxddxdx 


xydxxdxxxy 


3 00U 


Cdy cy udxyd 


^aaaaa+^^a 
C.dddddXXv.d 


yyxxyxdxxy 


xyxxxyyxdy 


yxyyxdxdxx 


dyxxy xxy XX 




yd Ldx. xyx xy 


a a ^ n tt r' n ■3 +■ 

ddCdygcgdX 


^ ft ft ^ ^ ^ +" +■ 
gcggdxcxgx 


^a^i^^a^i^a^ 

xdgxxdgxdx 


xxdy xyggxd 


ggdL xdxgcd 


3 / oU 


dCCyCdCXdX 


gdXCXCdXXC 


xggxcgg xgc 


cggxx xggcx 


aaxggcc xxa 


xcgcgcxxcg 




y cx L\.dy cdd 


Cdycdxccyy 


a^a^rt^Aina^ 

dXdxyxyydx 


xxxyxxxdxx 


gaggcgggrc 


^ ^ ^ /"i ft f^ ft ft ft 
cxgaggcggg 




dyyyddCCdX 


dccxygxccx 


^^^a^/ia a/ia 
XXXdXyddyd 


ggdxxxddcg 


XXgddXXdgx 


dxxgx xggdx 




dy cyccycxx 


gxggxccaxc 


dxxgy xxxyd 


cxdccdggxx 


^ ft ■+* ^ f^ f" f ^ 

xgxxxcxxxx 


ddxgxcgxxg 


'\\JC\j 


ccdxyxyddw. 


dyxyy cxdcx 


dxxyxyxydx 


xxxxxyy xdx 


xxxyxxyyyd 


Xdxxxxggxd 




dUdy L L Lyy d 


/~aa*~a^^^a^ 
i.ddCd X L Ld L 


yy X xy Xd Ldx 


xyxyy l l LXd 


yxxy L LXd xy 


xxyddxxyyx 




ccdyxxdycy 


ft 3 r\ r% f ^ ft fy a 

ydxyyccyya 


XXdXXXdXyx 


^a£i^a^a#^^rt 

Xdy LdXdgxg 


a^^/ia^rt/ia^ 

dxcydxygdx 


ft ft ft ft ^ ^ A" ^ ^ 
gggg X xdxdx 




y ccxydxxcx 


y CdCXdcycy 


■l"artna"t"'f"/"/*a 

Xdygdxxxxd 


rt^^a++^ a^^ 

yyxdXXXdXx 


ft ft ^ ^ ^ ft 0 ^ 
ggxcaggagx 


^/i^aa/~^rta^ 

ggxddx xgag 




cy cy ccy Cdx 


yyxxxdxxyx 


^a^^/ia^^a^ 

XdXXydXXdX 


xdxyydxy xy 


dxggxxgdxx 


art^aa a ■3 ft 

dgxddddxgg 




U Ld\-v.y U L L L. 


y L I. Ld cdv.U.L. 


LyXXyL.LLL(_ 


uy xddxxy Xd 


XLyxLydLxy 


aana^a^a^a 
ddy dXdXdXd 




V. Ldv.dL Uydv. 


aa/^/*i^'t"aa^^ 
ddy y X Ldd LX 


L LXdyyxxyd 


dxyyyxyxy l 


^a/iaa/"a^^/~ 
XdyddXdL LX 


yxydLLdLyx 




Ly (.y v-y dL.dy 


yy L Lyyxxy l 


LdXdydXy L L 


yxLyxyyydd 


oa a /~ 3 /^ /^ /~i "t" n 

yddxdyyy Ly 


XdL LyXXXd L 




LaCyXXad\-y 


ggcgaXddxx 


y xxdy L L L Ly 


*^/~aa/**a*^^aa 
yXddXdyXdd 


f~ f~ f~\ f f~ f~ 
xcgxddycxx 


gxdgxggax L 


^ J DU 


dcgcgccggg 


xxgxxxxdxx 


xydxddxxyy 


XXdXXXXXXd 


ccgcxxgcgg 


-t- /~ /~ "t" n /~ 

xggxgx xggc 


Aft')(\ 


cgdxcgxcxc 


dgcgcgcxgg 


dxgxgxxxdc 


XXCX LXClCX 


g xxcdCxaga 


cga L Lgcxcd 


H-DOU 


cxxxgcccag 


Cddcgxxggc 


agcddCdggg 


gx XX L xxxgc 


dxgcxgddxc 


ft f~ ^ "f- /-» +• +* ft +■ 
gxdxgxxg XX 


A7 AO 


xxxcigccggci 


ccggccgdgx 


xdxgxxyy xy 


xgxgaxgxdg 


XyXLLXLdLy 


yx L LdXXXyd 


AROn 
HOL/U 


yyd L Lxgd lx 


yxxxyxLxxx 


d Ly xy yy add 


dXLXdxxyxy 


dxxydLxyyx 


LdXyXdLLXL 


*tO OLI 


ydycyy cddy 


xxy xxxyxxx 


xxy L L L Lxyx 


yy xdxxy Xdy 


nr"a a't~1"a1"na 
yXddL LdLyd 


xy dx LXd Lxy 




^+"/^a arta^^rt 

xxyddydycy 


a^ + a^a+"rtaa 

dCXdCdXydd 


a^*~aa^^a^fi 

dxxddxxdxy 


gxddxxyyxy 


xgggxxxxyy 


xygxxxggxd 


*+!70LI 


ctggcaattc 


gtttacaggc 


cgcaggtatt 


cctgttttgc 


tgcttgagca 


gcgcgacaag 


5040 


ccgggtggcc 


gggcttatgt 


ttatcaggag 


cagggcttta 


cttttgatgc 


aggccctacc 


5100 


gttatcaccg 


atcccagcgc 


gattgaagaa 


ctgtttgctc 


tggccggtaa 


acagcttaag 


5160 


gattacgtcg 


agctgttgcc 


ggtcacgccg 


ttttatcgcc 


tgtgctggga 


gtccggcaag 


5220 
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gtcttcaatt 


acgataacga 


ccaggcccag 


ttagaagcgc 


agatacagca 


gtttaatccg 


5280 


cgcgatgttg 


cgggttatcg 


agcgttcctt 


gactattcgc 


gtgccgtatt 


caatgagggc 


5340 


tatctgaagc 


tcggcactgt 


gcctttttta 


tcgttcaaag 


acatgcttcg 


ggccgcgccc 


5400 


cagttggcaa 


agctgcaggc 


atggcgcagc 


gtttacagta 


aagttgccgg 


ctacattgag 


5460 


gatgagcatc 


ttcggcaggc 


gttttctttt 


cactcgctct 


tagtgggggg 


gaatccgttt 


5520 


gcaacctcgt 


ccatttatac 


gctgattcac 


gcgttagaac 


gggaatgggg 


cgtctggttt 


5580 


ccacgcggtg 


gaaccggtgc 


gctggtcaat 


ggcatgatca 


agctgtttca 


ggatctgggc 


5640 


ggcgaagtcg 


tgcttaacgc 


ccgggtcagt 


catatggaaa 


ccgttgggga 


caagattcag 


5700 


gccgtgcagt 


tggaagacgg 


cagacggttt 


gaaacctgcg 


cggtggcgtc 


gaacgctgat 


5760 


gttgtacata 


cctatcgcga 


tctgctgtct 


cagcatcccg 


cagccgctaa 


gcaggcgaaa 


5820 


aaactgcaat 


ccaagcgtat 


gagtaactca 


ctgtttgtac 


tctattttgg 


tctcaaccat 


5880 


catcacgatc 


aactcgccca 


tcataccgtc 


tgttttgggc 


cacgctaccg 


tgaactgatt 


5940 


cacgaaattt 


ttaaccatga 


tggtctggct 


gaggattttt 


cgctttattt 


acacgcacct 


6000 


tgtgtcacgg 


atccgtcact 


ggcaccggaa 


gggtgcggca 


gctattatgt 


gctggcgcct 


6060 


gttccacact 


taggcacggc 


gaacctcgac 


tgggcggtag 


aaggaccccg 


actgcgcgat 


6120 


cgtatttttg 


actaccttga 


gcaacattac 


atgcctggct 


tgcgaagcca 


gttggtgacg 


6180 


caccgtatgt 


ttacgccgtt 


cgatttccgc 


gacgagctca 


atgcctggca 


aggttcggcc 


6240 


ttctcggttg 


aacctattct 


gacccagagc 


gcctggttcc 


gaccacataa 


ccgcgataag 


6300 


cacattgata 


atctttatct 


ggttggcgca 


ggcacccatc 


ctggcgcggg 


cattcccggc 


6360 


gtaatcggct 


cggcgaaggc 


gacggcaggc 


ttaatgctgg 


aggacctgat 


ttgacgaata 


6420 


cgtcattact 


gaatcatgcc 


gtcgaaacca 


tggcggttgg 


ctcgaaaagc 


tttgcgactg 


6480 


catcgacgct 


tttcgacgcc 


aaaacccgtc 


gcagcgtgct 


gatgctttac 


gcatggtgcc 


6540 


gccactgcga 


cgacgtcatt 


gacgatcaaa 


cactgggctt 


tcatgccgac 


cagccctctt 


6600 


cgcagatgcc 


tgagcagcgc 


ctgcagcagc 


ttgaaatgaa 


aacgcgtcag 


gcctacgccg 


6660 


gttcgcaaat 


gcacgagccc 


gcttttgccg 


cgtttcagga 


ggtcgcgatg 


gcgcatgata 


6720 


tcgctcccgc 


ctacgcgttc 


gaccatctgg 


aaggttttgc 


catggatgtg 


cgcgaaacgc 


6780 


gctacctgac 


actggacgat 


acgctgcgtt 


attgctatca 


cgtcgccggt 


gttgtgggcc 


6840 


tgatgatggc 


gcaaattatg 


ggcgttcgcg 


ataacgccac 


gctcgatcgc 


gcctgcgatc 


6900 


tcgggctggc 


tttccagttg 


accaacattg 


cgcgtgatat 


tgtcgacgat 


gctcaggtgg 


6960 


gccgctgtta 


tctgcctgaa 


agctggctgg 


aagaggaagg 


actgacgaaa 


gcgaattatg 


7020 


ctgcgccaga 


aaaccggcag 


gccttaagcc 


gtatcgccgg 


gcgactggta 


cgggaagcgg 


7080 


aaccctatta 


cgtatcatca 


atggccggtc 


tggcacaatt 


acccttacgc 


tcggcctggg 


7140 


ccatcgcgac 


agcgaagcag 


gtgtaccgta 


aaattggcgt 


gaaagttgaa 


caggccggta 


7200 


agcaggcctg 


ggatcatcgc 


cagtccacgt 


ccaccgccga 


aaaattaacg 


cttttgctga 


7260 
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cggcatccgg 


tcaggcagtt 


acttcccgga 


tigaagacgta 


tccaccccgt 


cctgcticatc 


/ OilU 


"tctggcagcg 


cccgatic'tag 


ccgcatgcct 


"ttctctcagc 


gtcgcctgaa 


gtttagataa 


/ 30U 


cggtggcgcg 


tacagaaaac 


caaaggacac 


gcagccctct 


t t tcccctta 


cagcatgatg 


/440 


ca.LSicgg lqq 


gcca'tg'ta'ta 


accgt't'tcag 


gtagcct't'tg 


cgcgg'taligt: 


agcggaacgg 




ccagcgctgg 


LgLaccaguc 


eg Lcgrggac 


ca'taaaatac 


agtiaaaccat 


aagcggtcat 


7 ^an 


gcctgcacca 


atccactgga 


gcggccagat 


tcctgtactg 


ccgaagtaaa 


tcagggcaat 


7CO A 


cgacacaaug 


gcgaatiacca 


cggcanagag 


aucg LLaacTi 


"tcaaaligcgc 


ctttacgcgg 


/boO 


Lxcargarg L 


gaaagaxgcc 


agccccaacc 


ccagccg rgc 


argargTaxt 


tatgtgccag 


77>l O 

77h{J 


iigcagcaacc 


acuLCcargc 


cgaccacggr 


gacaaacacg 


atcagggcat 


tccaaatcca 


70/\ A 

7800 


caacataa't't 


tctcaagggc 


gaat'tcgcgg 


ggatcctcta 


gagtcgacct 


gcaggcatgc 


•70 C A 

7od0 


aagcttggca 


ctggccgtcg 


ttttacaacg 


tcgtgactgg 


gaaaaccctg 


gcgttaccca 


7 AT A 

79Z0 


acttaatcgc 


cttgcagcac 


atcccccttt 


cgccagctgg 


cgtaatagcg 


aagaggcccg 


7980 


caccgatcgc 


ccttcccaac 


agttgcgcag 


cctgaatggc 


gaatggcgct 


gatgtccggc 


8040 


ggtgcttttg 


ccg'ttiacgca 


ccaccccgtc 


ag'tagctgaa 


caggagggac 


agctgataga 


0 1 A A 

8100 


aacagaagcc 


actggagcac 


ctcaaaaaca 


ccatcataca 


ctaaatcagt 


aagttggcag 


8160 


calicacccga 


cgcaclililigc 


gccgaa'taaa 


tiacctgtigac 


ggaaga'tcac 


litcgcagaat 


8^^0 


aaataaatcc 


tggtgtccct 


gttgataccg 


ggaagccctg 


ggccaacttt 


tggcgaaaat 


0 1 OA 

8280 


gagacgttga 


tcggcacgta 


agaggttcca 


actttcacca 


taatgaaata 


agatcactac 


0 D >l A 

8340 


cgggcgLaTi L 


LL Lxgagt xa 


xcgagattLt 


caggagctaa 


ggaagctaaa 


atggagaaaa 


0 A A A 

8400 


aaatcactgg 


atataccacc 


gttgatatat 


cccaatggca 


tcgtaaagaa 


cattttgagg 


8460 


catttcagtc 


agttgctcaa 


tgtacctata 


accagaccgt 


tcagctggat 


attacggcct 


8520 


ttttaaagac 


cgtaaagaaa 


aataagcaca 


agttttatcc 


ggcctttatt 


cacattcttg 


8580 


cccgcctgat 


gaatgctcat 


ccggaattt 








8609 



<210> 65 
<211> 6329 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Plasmid pKD46 

<400> 65 

catcgattta ttatgacaac ttgacggcta catcattcac tttttcttca caaccggcac 60 

ggaactcgct cgggctggcc ccggtgcatt ttttaaatac ccgcgagaaa tagagttgat 120 

cgtcaaaacc aacattgcga ccgacggtgg cgataggcat ccgggtggtg ctcaaaagca 180 

gcttcgcctg gctgatacgt tggtcctcgc gccagcttaa gacgctaatc cctaactgct 240 

ggcggaaaag atgtgacaga cgcgacggcg acaagcaaac atgctgtgcg acgctggcga 300 

tatcaaaatt gctgtctgcc aggtgatcgc tgatgtactg acaagcctcg cgtacccgat 360 

tatccatcgg tggatggagc gactcgttaa tcgcttccat gcgccgcagt aacaattgct 420 
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caagcagatt 


tatcgccagc 


agctccgaat 


agcgcccttc 


cccttgcccg 


gcgttaatga 


480 


tttgcccaaa 


caggtcgctg 


aaatgcggct 


ggtgcgcttc 


atccgggcga 


aagaaccccg 


540 


tattggcaaa 


tattgacggc 


cagttaagcc 


attcatgcca 


gtaggcgcgc 


ggacgaaagt 


600 


aaacccactg 


gtgataccat 


tcgcgagcct 


ccggatgacg 


accgtagtga 


tgaatctctc 


660 


ctggcgggaa 


cagcaaaata 


tcacccggtc 


ggcaaacaaa 


ttctcgtccc 


tgatttttca 


720 


ccaccccctg 


accgcgaatg 


gtgagattga 


gaatataacc 


tttcattccc 


agcggtcggt 


780 


cgataaaaaa 


atcgagataa 


ccgttggcct 


caatcggcgt 


taaacccgcc 


accagatggg 


840 


cattaaacga 


gtatcccggc 


agcaggggat 


cattttgcgc 


ttcagccata 


cttttcatac 


900 


tcccgccatt 


cagagaagaa 


accaattgtc 


catattgcat 


cagacattgc 


cgtcactgcg 


960 


tcttttactg 


gctcttctcg 


ctaaccaaac 


cggtaacccc 


gcttattaaa 


agcattctgt 


1020 


aacaaagcgg 


gaccaaagcc 


atgacaaaaa 


cgcgtaacaa 


aagtgtctat 


aatcacggca 


1080 


gaaaagtcca 


cattgattat 


ttgcacggcg 


tcacactttg 


ctatgccata 


gcatttttat 


1140 


ccataagatt 


agcggatcct 


acctgacgct 


ttttatcgca 


actctctact 


gtttctccat 


1200 


acccgttttt 


ttgggaattc 


gagctctaag 


gaggttataa 


aaaatggata 


ttaatactga 


1260 


aactgagatc 


aagcaaaagc 


attcactaac 


cccctttcct 


gttttcctaa 


tcagcccggc 


1320 


atttcgcggg 


cgatattttc 


acagctattt 


caggagttca 


gccatgaacg 


cttattacat 


1380 


tcaggatcgt 


cttgaggctc 


agagctgggc 


gcgtcactac 


cagcagctcg 


cccgtgaaga 


1440 


gaaagaggca 


gaactggcag 


acgacatgga 


aaaaggcctg 


ccccagcacc 


tgtttgaatc 


1500 


gctatgcatc 


gatcatttgc 


aacgccacgg 


ggccagcaaa 


aaatccatta 


cccgtgcgtt 


1560 


tgatgacgat 


gttgagtttc 


aggagcgcat 


ggcagaacac 


atccggtaca 


tggttgaaac 


1620 


cattgctcac 


caccaggttg 


atattgattc 


agaggtataa 


aacgaatgag 


tactgcactc 


1680 


gcaacgctgg 


ctgggaagct 


ggctgaacgt 


gtcggcatgg 


attctgtcga 


cccacaggaa 


1740 


ctgatcacca 


ctcttcgcca 


gacggcattt 


aaaggtgatg 


ccagcgatgc 


gcagttcatc 


1800 


gcattactga 


tcgttgccaa 


ccagtacggc 


cttaatccgt 


ggacgaaaga 


aatttacgcc 


1860 


tttcctgata 


agcagaatgg 


catcgttccg 


gtggtgggcg 


ttgatggctg 


gtcccgcatc 


1920 


atcaatgaaa 


accagcagtt 


tgatggcatg 


gactttgagc 


aggacaatga 


atcctgtaca 


1980 


tgccggattt 


accgcaagga 


ccgtaatcat 


ccgatctgcg 


ttaccgaatg 


gatggatgaa 


2040 


tgccgccgcg 


aaccattcaa 


aactcgcgaa 


ggcagagaaa 


tcacggggcc 


gtggcagtcg 


2100 


catcccaaac 


ggatgttacg 


tcataaagcc 


atgattcagt 


gtgcccgtct 


ggccttcgga 


2160 


tttgctggta 


tctatgacaa 


ggatgaagcc 


gagcgcattg 


tcgaaaatac 


tgcatacact 


2220 


gcagaacgtc 


agccggaacg 


cgacatcact 


ccggttaacg 


atgaaaccat 


gcaggagatt 


2280 


aacactctgc 


tgatcgccct 


ggataaaaca 


tgggatgacg 


acttattgcc 


gctctgttcc 


2340 


cagatatttc 


gccgcgacat 


tcgtgcatcg 


tcagaactga 


cacaggccga 


agcagtaaaa 


2400 


gctcttggat 


tcctgaaaca 


gaaagccgca 


gagcagaagg 


tggcagcatg 


acaccggaca 


2460 
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XdXLXXdLyd 
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XCXCXXCdCX 
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dddy XX xxxd 


•3 ^ ^ -3 -3 "3 /-i -3 +- 

dxxdddy y d L 


LXXLydXLLX 


49nn 


L.dLdyLLl.Ll> 


y XL,dXL.dy (.X 


v.Li.LyyLLy(. 
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yxxxxxxxxx 


4^7n 

HD /LKJ 


tgtagggttt 


tcaatcgtgg 


ggttgagtag 


tgccacacag 


cataaaatta 


gcttggtttc 


4380 


atgctccgtt 


aagtcatagc 


gactaatcgc 


tagttcattt 


gctttgaaaa 


caactaattc 


4440 


agacatacat 


ctcaattggt 


ctaggtgatt 


ttaatcacta 


taccaattga 


gatgggctag 


4500 


tcaatgataa 


ttactagtcc 


ttttcctttg 


agttgtgggt atctgtaaat 
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tctgctagac 


4560 



ctttgctgga 


aaacttgtaa 


attctgctag 


accctctgta 


aattccgcta 


gacctttgtg 


4620 


tgtttttttt 


gtttatattc 


aagtggttat 


aatttataga 


ataaagaaag 


aataaaaaaa 


4680 


gataaaaaga 


atagatccca 


gccctgtgta 


taactcacta 


ctttagtcag 


ttccgcagta 


4740 


ttacaaaagg 


atgtcgcaaa 


cgctgtttgc 


tcctctacaa 


aacagacctt 


aaaaccctaa 


4800 


aggcttaagt 


agcaccctcg 


caagctcggt 


tgcggccgca 


atcgggcaaa 


tcgctgaata 


4860 


ttccttttgt 


ctccgaccat 


caggcacctg 


agtcgctgtc 


tttttcgtga 


cattcagttc 


4920 


gctgcgctca 


cggctctggc 


agtgaatggg 


ggtaaatggc 


actacaggcg 


ccttttatgg 


4980 


attcatgcaa 


ggaaactacc 


cataatacaa 


gaaaagcccg 


tcacgggctt 


ctcagggcgt 


5040 


tttatggcgg 


gtctgctatg 


tggtgctatc 


tgactttttg 


ctgttcagca 


gttcctgccc 


5100 


tctgattttc 


cagtctgacc 


acttcggatt 


atcccgtgac 


aggtcattca 


gactggctaa 


5160 


tgcacccagt 


aaggcagcgg 


tatcatcaac 


ggggtctgac 


gctcagtgga 


acgaaaactc 


5220 


acgttaaggg 


attttggtca 


tgagattatc 


aaaaaggatc 


ttcacctaga 


tccttttaaa 


5280 


ttaaaaatga 


agttttaaat 


caatctaaag 


tatatatgag 


taaacttggt 


ctgacagtta 


5340 


ccaatgctta 


atcagtgagg 


cacctatctc 


agcgatctgt 


ctatttcgtt 


catccatagt 


5400 


tgcctgactc 


cccgtcgtgt 


agataactac 


gatacgggag 


ggcttaccat 


ctggccccag 


5460 


tgctgcaatg 


ataccgcgag 


acccacgctc 


accggctcca 


gatttatcag 


caataaacca 


5520 


gccagccgga 


agggccgagc 


gcagaagtgg 


tcctgcaact 


ttatccgcct 


ccatccagtc 


5580 


tattaattgt 


tgccgggaag 


ctagagtaag 


tagttcgcca 


gttaatagtt 


tgcgcaacgt 


5640 


tgttgccatt 


gctacaggca 


tcgtggtgtc 


acgctcgtcg 


tttggtatgg 


cttcattcag 


5700 


ctccggttcc 


caacgatcaa 


ggcgagttac 


atgatccccc 


atgttgtgca 


aaaaagcggt 


5760 


tagctccttc 


ggtcctccga 


tcgttgtcag 


aagtaagttg 


gccgcagtgt 


tatcactcat 


5820 


ggttatggca 


gcactgcata 


attctcttac 


tgtcatgcca 


tccgtaagat 


gcttttctgt 


5880 


gactggtgag 


tactcaacca 


agtcattctg 


agaatagtgt 


atgcggcgac 


cgagttgctc 


5940 


ttgcccggcg 


tcaatacggg 


ataataccgc 


gccacatagc 


agaactttaa 


aagtgctcat 


6000 


cattggaaaa 


cgttcttcgg 


ggcgaaaact 


ctcaaggatc 


ttaccgctgt 


tgagatccag 


6060 


ttcgatgtaa 


cccactcgtg 


cacccaactg 


atcttcagca 


tcttttactt 


tcaccagcgt 


6120 


ttctgggtga 


gcaaaaacag 


gaaggcaaaa 


tgccgcaaaa 


aagggaataa 


gggcgacacg 


6180 


gaaatgttga 


atactcatac 


tcttcctttt 


tcaatattat 


tgaagcattt 


atcagggtta 


6240 


ttgtctcatg 


agcggataca 


tatttgaatg 


tatttagaaa 


aataaacaaa 


taggggttcc 


6300 


gcgcacattt 


ccccgaaaag 


tgccacctg 








6329 
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<211> 3423 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Plasmid pSUHS 
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agattgcagc 


attacacgtc 


ttgagcgatt 


gtgtaggctg 


gagctgcttc 


gaagttccta 


60 


tactttctag 


agaataggaa 


cttcggaata 


ggaacttcaa 


gatcccctca 


cgctgccgca 


120 


agcactcagg 


gcgcaagggc 


tgctaaagga 


agcggaacac 


gtagaaagcc 


agtccgcaga 


180 


aacggtgctg 


accccggatg 


aatgtcagct 


actgggctat 


ctggacaagg 


gaaaacgcaa 


240 


gcgcaaagag 


aaagcaggta 


gcttgcagtg 


ggcttacatg 


gcgatagcta 


gactgggcgg 


300 


ttttatggac 


agcaagcgaa 


ccggaattgc 


cagctggggc 


gccctctggt 


aaggttggga 


360 


agccctgcaa 


agtaaactgg 


atggctttct 


tgccgccaag 


gatctgatgg 


cgcaggggat 


420 


caagatctga 


tcaagagaca 


ggatgaggat 


cgtttcgcat 


gattgaacaa 


gatggattgc 


480 


acgcaggttc 


tccggccgct 


tgggtggaga 


ggctattcgg 


ctatgactgg 


gcacaacaga 


540 


caatcggctg 


ctctgatgcc 


gccgtgttcc 


ggctgtcagc 


gcaggggcgc 


ccggttcttt 


600 


ttgtcaagac 


cgacctgtcc 


ggtgccctga 


atgaactgca 


ggacgaggca 


gcgcggctat 


660 


cgtggctggc 


cacgacgggc 


gttccttgcg 


cagctgtgct 


cgacgttgtc 


actgaagcgg 


720 


gaagggactg 


gctgctattg 


ggcgaagtgc 


cggggcagga 


tctcctgtca 


tctcaccttg 


780 


ctcctgccga 


gaaagtatcc 


atcatggctg 


atgcaatgcg 


gcggctgcat 


acgcttgatc 


840 


cggctacctg 


cccattcgac 


caccaagcga 


aacatcgcat 


cgagcgagca 


cgtactcgga 


900 


tggaagccgg 


tcttgtcgat 


caggatgatc 


tggacgaaga 


gcatcagggg 


ctcgcgccag 


960 


ccgaactgtt 


cgccaggctc 


aaggcgcgca 


tgcccgacgg 


cgaggatctc 


gtcgtgaccc 


1020 


atggcgatgc 


ctgcttgccg 


aatatcatgg 


tggaaaatgg 


ccgcttttct 


ggattcatcg 


1080 


actgtggccg 


gctgggtgtg 


gcggaccgct 


atcaggacat 


agcgttggct 


acccgtgata 


1140 


ttgctgaaga 


gcttggcggc 


gaatgggctg 


accgcttcct 


cgtgctttac 


ggtatcgccg 


1200 


ctcccgattc 


gcagcgcatc 


gccttctatc 


gccttcttga 


cgagttcttc 


tgagcgggac 


1260 


tctggggttc 


gaaatgaccg 


accaagcgac 


gcccaacctg 


ccatcacgag 


atttcgattc 


1320 


caccgccgcc 


ttctatgaaa 


ggttgggctt 


cggaatcgtt 


ttccgggacg 


ccggctggat 


1380 


gatcctccag 


cgcggggatc 


tcatgctgga 


gttcttcgcc 


caccccagct 


tcaaaagcgc 


1440 


tctgaagttc 


ctatactttc 


tagagaatag 


gaacttcgga 


ataggaacta 


aggaggatat 


1500 


tcactataaa 


aataggcgta 


tcacgaggcc 


ctttcgtctt 


cacctcgaga 


aatcataaaa 


1560 


aatttatttg 


ctttgtgagc 


ggataacaat 


tataatagat 


tcaattgtga 


gcggataaca 


1620 


atttcacaca 


gaattcatta 


aagaggagaa 


attaactcat 


atggaccatg 


gctaattccc 


1680 


atgtcagccg 


ttaagtgttc 


ctgtgtcact 


gaaaattgct 


ttgagaggct 


ctaagggctt 


1740 


ctcagtgcgt 


tacatccctg 


gcttgttgtc 


cacaaccgtt 


aaaccttaaa 


agctttaaaa 


1800 


gccttatata 


ttcttttttt 


tcttataaaa 


cttaaaacct 


tagaggctat 


ttaagttgct 


1860 


gatttatatt 


aattttattg 


ttcaaacatg 


agagcttagt 


acgtgaaaca 


tgagagctta 


1920 


gtacgttagc 


catgagagct 


tagtacgtta 


gccatgaggg 


tttagttcgt 


taaacatgag 


1980 


agcttagtac 


gttaaacatg 


agagcttagt 


acgtgaaaca tgagagctta 
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gtacgtacta 


2040 



tcaacaggtt 


gaactgcgga 


tcttgcggcc 


gcaaaaatta 


aaaatgaagt 


tttaaatcaa 


2100 


tctaaagtat 


atatgagtaa 


acttggtctg 


acagttacca 


atgcttaatc 


agtgaggcac 


2160 


ctatctcagc 


gatctgtcta 


tttcgttcat 


ccatagttgc 


ctgactcccc 


gtcgtgtaga 


2220 


taactacgat 


acgggagggc 


ttaccatctg 


gccccagtgc 


tgcaatgata 


ccgcgagacc 


2280 


cacgctcacc 


ggctccagat 


ttatcagcaa 


taaaccagcc 


agccggaagg 


gccgagcgca 


2340 


gaagtggtcc 


tgcaacttta 


tccgcctcca 


tccagtctat 


taattgttgc 


cgggaagcta 


2400 


gagtaagtag 


ttcgccagtt 


aatagtttgc 


gcaacgttgt 


tgccattgct 


acaggcatcg 


2460 


tggtgtcacg 


ctcgtcgttt 


ggtatggctt 


cattcagctc 


cggttcccaa 


cgatcaaggc 


2520 


gagttacatg 


atcccccatg 


ttgtgcaaaa 


aagcggttag 


ctccttcggt 


cctccgatcg 


2580 


ttgtcagaag 


taagttggcc 


gcagtgttat 


cactcatggt 


tatggcagca 


ctgcataatt 


2640 


ctcttactgt 


catgccatcc 


gtaagatgct 


tttctgtgac 


tggtgagtac 


tcaaccaagt 


2700 


cattctgaga 


atagtgtatg 


cggcgaccga 


gttgctcttg 


cccggcgtca 


atacgggata 


2760 


ataccgcgcc 


acatagcaga 


actttaaaag 


tgctcatcat 


tggaaaacgt 


tcttcggggc 


2820 


gaaaactctc 


aaggatctta 


ccgctgttga 


gatccagttc 


gatgtaaccc 


actcgtgcac 


2880 


ccaactgatc 


ttcagcatct 


tttactttca 


ccagcgtttc 


tgggtgagca 


aaaacaggaa 


2940 


ggcaaaatgc 


cgcaaaaaag 


ggaataaggg 


cgacacggaa 


atgttgaata 


ctcatactct 


3000 


tcctttttca 


atattattga 


agcatttatc 


agggttattg 


tctcatgagc 


ggatacatat 


3060 


ttgaatgtat 


ttagaaaaat 


aaacaaatag 


gggttccgcg 


cacatttccc 


cgaaaagtgc 


3120 


cacctgcatc 


gatggccccc 


cgatggtagt 


gtggggtctc 


cccatgcgag 


agtagggaac 


3180 


tgccaggcat 


caaataaaac 


gaaaggctca 


gtcgaaagac 


tgggcctttc 


gttttatctg 


3240 


ttgtttgtcg 


gtgaacgctc 


tcctgagtag 


gacaaatccg 


ccgggagcgg 


atttgaacgt 


3300 


tgcgaagcaa 


cggcccggag 


ggtggcgggc 


aggacgcccg 


ccataaactg 


ccaggcatca 


3360 


aattaagcag 


aaggccatcc 


tgacggatgg 


cctttttgcg 


tggccagtgc 


caagcttgca 


3420 


tgc 












3423 
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